Gene Caul_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2072 
Symbol 
ID5899527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2214033 
End bp2217044 
Gene Length3012 bp 
Protein Length1003 aa 
Translation table11 
GC content67% 
IMG OID641562561 
ProductSMP-30/gluconolaconase/LRE domain-containing protein 
Protein accessionYP_001683698 
Protein GI167646035 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3386] Gluconolactonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.437854 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCGG TCGTCCTGCT GCTGGCGATG AGCACTCCGG CGACGGCCTC GGTCTCCACC 
TTCACGGCGG CGCCCGACGA TCCGCGCGCG GTCGTGGCGC GGGGCAAGGG CGACGGTCGC
GCGGACGATA GCGCCGCCCT CCAGAGCGCG ATCGACGCCG CCGCGGCCAA GCCGGGCGGC
GGCTTGGTGT TCTTGCCCTC GGGCCGCTAC CGGATCAGCA AGACGATCTT CCTCTGGCCG
GGCGTGCGTG TGTTCGGGGT CGGCGCGACC CGTCCGCTGA TCCTGCTCGC GGACAACGCG
CCCGGGTTCC AGGAAGGTCT GGCGAACATG GTGATCTTCG CAGGGGCCAA GCCGGACCCG
TCGCGTCGCG TGCCGTTCCC CCCGCCCGGA AGCGTTCCCT TCAACAAGGA TATCGCCGAC
GCCACTCCCA ACAGCTTCTA TTCGGCGATG AGCAACATCG ACTTCAAGAT CGGGCGGGGC
AATCCGGCGG CGACGGCGAT TCGCTTCCAT GCCGCCCAGC ACGCCTATCT CAGCCACATG
GATTTCGATC TCGGCTCGGG CCTGGCGGGC CTGTACCAGG TCGCCAATGA GGCGGAGGAT
CTCCACTTCA AGGGCGGCCG CTACGGCATT CTCACCGAAA AGCCCTCGCC AGCCTGGGGT
TTCGTGCTGC TGGACTCGAC CTTCGAAGGG CAGCGCGACG CCGCGATCCG CGAGCATGAA
GCCGGCCTCA CCCTGGTGAA TGTGGCGATG CGCGACACGC CCGTGGGGAT CGAGATCGAC
CGCGGCTATG GCGACTGGCT GTGGGGCAAG GACGTCCGTT TCGAGAACGT CTCGAAGGCC
GGCGTCGTCA TCTCCAACGA GGCCAGCGTC TATACCCAGA TCGGCTTCGA GAACGCGGTG
GCGGTCAATA CGCCAGTGTT CGCCCGCTTT CGCGACAGCG GCAAGATCGT GGCCGGCAAG
GGCCGCGTCT ACGGCGTCGC GTCCTTCACC CACGGCTTGA CGTTGCCAGG CCCTGGCCAG
ATGGGCGAGT ACAGGACGGA GATGAAGGCC GACGCCCTTG GCGCGCCGCC GAAGCCTCGC
GCGCCGGCGA TCCGGGCTCT GCCCGCCATG GCCGAATGGG CCAATGTGCG TACGCTGGGC
GCCAAGGGCG ACAACAGCAC CAACGACACC GCCGCGATCC AGCGCGCGAT CGACACCCAT
CGGGTCGTCT ATTTCCCGGC CGGCTTCTAT GTGGTCGCCG ACACGCTGCG CTTGCGGCCG
GACACGGTCC TGATCGGCCT GCATCCAAGC CTGACGCAGA TCGTCTTGCC CGATCGCACC
CCTGGCTATC AGGGCGTCGG CGCCCCCAAG GCGCTGGTCG AGTCCGCCGC CAAGGGCGAC
AATATCGTCT CCGGGCTGGG TCTCAACACG GGCGGCGTCA ATCCCCGCGC CACGGCGCTG
CTGTGGACGG CTGGCGCCGA TTCGATGGTC AATGACGTCA AGTTCCAGGG GGGCCACGGC
ACCGACCTCT ATGACGGGTC GCGGTTCAGC CCCTACAACG CCAACGGGAC GGCCGACGCC
GATCCGGCCA AGCGCTGGGC CGGCCAGTAC CCCAGCCTAT GGGTGACGAA CGGCGGCGGC
GGCACGTTCG CCAACCTCTG GAGCCCCAGC ACCTACGCCT ATTCCGGTAT CTATATCTCC
GACACCCAAA CGCCTGGTCA TATCTACCAG GCCTCGGTCG AGCACCATGT GCGCACCGAG
ATCAGCCTCA ATCGGGTGGC CCACTGGGAG TTGCTGGCGC CGCAGACCGA GGAGGAGGCG
GGGGAGGGCG AGGACACCGT CTCGCTGGAA ATCCGCGACT CGCACGATAT TCTGGTCGCC
AACTATCACG CCTATCGCGT GACCCGGACG CGCAAGCCGG CTGTGGCGGC GGCCAAGGTC
TACAATTCCC ATGACATCCG CTTCCGCAAC GTGGCGGTGA ACGCCGAAAG CGGGGTGGGG
CTCTGCGACG AGAACGGCTG CGCCACCTTC CTGCGTCCGA GCAAGTATCC GTACGAGAAC
GCCATCCAGG ACGTCACCAG CGGACTGGAG GTGCGCGAGC GGCAGTTCGC GGTTCTGGAC
CTGGCGGCCA ATCCGCCACC GGTCGCGCCG TCGACCTTCC CCGCCGGCGC CAAGGTGGAA
AAACTGGGCG ACGGCTTCTA TTCGTTGGGC GGCGGCGCCG TCGATGCGGC CGGGACGCTC
TACTTCACCG ACCGCCGTCA CCAGCGCATC TACAGCTGGT CCGAGGCGCG CAAGCTGTCG
ATCGTGCGCG ACAACACGCT GGATCCCATC AACCTGGCGG TCGATGGGTC CGGCAACTTG
CTGGTGCTGT CGTCGGATGG CCGCGACGGG TCGGTCTACA GCTTCAAGCC GGGATCCCCC
GACACCGAGG TTACGGTGAT CACGCCGACC CCGGCGGCTG ACCACCCGGA CGCGGCGACC
GTACTGCCCG TCAACTGGTG GAACAACGGC GAGTTCAAGG ACCAGCTGGA TCTCCAGACC
TATCAGTTCA CCACCTTGGA GCAGATGTTC GCCCGCGACA TGGCGCTCCC GAAGCCCAAG
GAATACATTT CGCCCGACGG CAGCCTGGCC TTGCCGGCGT ATCGCGTCTT CCAGCAGGGT
CCGCCCGACT ACCGGGCCAT GCGATTTTCT GATTCGCTCG ACACCTACGG CTTCACGACG
GCCAAGCCTG GCCAGCGGGT GTTCATCACC AACGGCTCGG AAAACAAGAC CTACAGCGGC
CTGGTCGGAA GGGGCGGGGC GATCACCGAT CTGAAGCCCT TCGCCAACCG GGGCGGCGAG
AGCGTCGCCA CGGACGGGGA GGGCCGCGTC TACGTCGCCA ATGGCCAGGT GTTCGTCTAC
GCCCCGGACG GCCGGGAACT GGGGCGCGTG GATATTCCCG AACGACCGCT GCAGTTGGTC
TTCGGCGGCC CTGACAGGCG GACGCTGTTT GTGCTGACCC ACCACGCGCT CTACGCGGTG
CGGCGCCAAT AG
 
Protein sequence
MAAVVLLLAM STPATASVST FTAAPDDPRA VVARGKGDGR ADDSAALQSA IDAAAAKPGG 
GLVFLPSGRY RISKTIFLWP GVRVFGVGAT RPLILLADNA PGFQEGLANM VIFAGAKPDP
SRRVPFPPPG SVPFNKDIAD ATPNSFYSAM SNIDFKIGRG NPAATAIRFH AAQHAYLSHM
DFDLGSGLAG LYQVANEAED LHFKGGRYGI LTEKPSPAWG FVLLDSTFEG QRDAAIREHE
AGLTLVNVAM RDTPVGIEID RGYGDWLWGK DVRFENVSKA GVVISNEASV YTQIGFENAV
AVNTPVFARF RDSGKIVAGK GRVYGVASFT HGLTLPGPGQ MGEYRTEMKA DALGAPPKPR
APAIRALPAM AEWANVRTLG AKGDNSTNDT AAIQRAIDTH RVVYFPAGFY VVADTLRLRP
DTVLIGLHPS LTQIVLPDRT PGYQGVGAPK ALVESAAKGD NIVSGLGLNT GGVNPRATAL
LWTAGADSMV NDVKFQGGHG TDLYDGSRFS PYNANGTADA DPAKRWAGQY PSLWVTNGGG
GTFANLWSPS TYAYSGIYIS DTQTPGHIYQ ASVEHHVRTE ISLNRVAHWE LLAPQTEEEA
GEGEDTVSLE IRDSHDILVA NYHAYRVTRT RKPAVAAAKV YNSHDIRFRN VAVNAESGVG
LCDENGCATF LRPSKYPYEN AIQDVTSGLE VRERQFAVLD LAANPPPVAP STFPAGAKVE
KLGDGFYSLG GGAVDAAGTL YFTDRRHQRI YSWSEARKLS IVRDNTLDPI NLAVDGSGNL
LVLSSDGRDG SVYSFKPGSP DTEVTVITPT PAADHPDAAT VLPVNWWNNG EFKDQLDLQT
YQFTTLEQMF ARDMALPKPK EYISPDGSLA LPAYRVFQQG PPDYRAMRFS DSLDTYGFTT
AKPGQRVFIT NGSENKTYSG LVGRGGAITD LKPFANRGGE SVATDGEGRV YVANGQVFVY
APDGRELGRV DIPERPLQLV FGGPDRRTLF VLTHHALYAV RRQ