Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2072 |
Symbol | |
ID | 5899527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2214033 |
End bp | 2217044 |
Gene Length | 3012 bp |
Protein Length | 1003 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641562561 |
Product | SMP-30/gluconolaconase/LRE domain-containing protein |
Protein accession | YP_001683698 |
Protein GI | 167646035 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3386] Gluconolactonase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.437854 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGGCGG TCGTCCTGCT GCTGGCGATG AGCACTCCGG CGACGGCCTC GGTCTCCACC TTCACGGCGG CGCCCGACGA TCCGCGCGCG GTCGTGGCGC GGGGCAAGGG CGACGGTCGC GCGGACGATA GCGCCGCCCT CCAGAGCGCG ATCGACGCCG CCGCGGCCAA GCCGGGCGGC GGCTTGGTGT TCTTGCCCTC GGGCCGCTAC CGGATCAGCA AGACGATCTT CCTCTGGCCG GGCGTGCGTG TGTTCGGGGT CGGCGCGACC CGTCCGCTGA TCCTGCTCGC GGACAACGCG CCCGGGTTCC AGGAAGGTCT GGCGAACATG GTGATCTTCG CAGGGGCCAA GCCGGACCCG TCGCGTCGCG TGCCGTTCCC CCCGCCCGGA AGCGTTCCCT TCAACAAGGA TATCGCCGAC GCCACTCCCA ACAGCTTCTA TTCGGCGATG AGCAACATCG ACTTCAAGAT CGGGCGGGGC AATCCGGCGG CGACGGCGAT TCGCTTCCAT GCCGCCCAGC ACGCCTATCT CAGCCACATG GATTTCGATC TCGGCTCGGG CCTGGCGGGC CTGTACCAGG TCGCCAATGA GGCGGAGGAT CTCCACTTCA AGGGCGGCCG CTACGGCATT CTCACCGAAA AGCCCTCGCC AGCCTGGGGT TTCGTGCTGC TGGACTCGAC CTTCGAAGGG CAGCGCGACG CCGCGATCCG CGAGCATGAA GCCGGCCTCA CCCTGGTGAA TGTGGCGATG CGCGACACGC CCGTGGGGAT CGAGATCGAC CGCGGCTATG GCGACTGGCT GTGGGGCAAG GACGTCCGTT TCGAGAACGT CTCGAAGGCC GGCGTCGTCA TCTCCAACGA GGCCAGCGTC TATACCCAGA TCGGCTTCGA GAACGCGGTG GCGGTCAATA CGCCAGTGTT CGCCCGCTTT CGCGACAGCG GCAAGATCGT GGCCGGCAAG GGCCGCGTCT ACGGCGTCGC GTCCTTCACC CACGGCTTGA CGTTGCCAGG CCCTGGCCAG ATGGGCGAGT ACAGGACGGA GATGAAGGCC GACGCCCTTG GCGCGCCGCC GAAGCCTCGC GCGCCGGCGA TCCGGGCTCT GCCCGCCATG GCCGAATGGG CCAATGTGCG TACGCTGGGC GCCAAGGGCG ACAACAGCAC CAACGACACC GCCGCGATCC AGCGCGCGAT CGACACCCAT CGGGTCGTCT ATTTCCCGGC CGGCTTCTAT GTGGTCGCCG ACACGCTGCG CTTGCGGCCG GACACGGTCC TGATCGGCCT GCATCCAAGC CTGACGCAGA TCGTCTTGCC CGATCGCACC CCTGGCTATC AGGGCGTCGG CGCCCCCAAG GCGCTGGTCG AGTCCGCCGC CAAGGGCGAC AATATCGTCT CCGGGCTGGG TCTCAACACG GGCGGCGTCA ATCCCCGCGC CACGGCGCTG CTGTGGACGG CTGGCGCCGA TTCGATGGTC AATGACGTCA AGTTCCAGGG GGGCCACGGC ACCGACCTCT ATGACGGGTC GCGGTTCAGC CCCTACAACG CCAACGGGAC GGCCGACGCC GATCCGGCCA AGCGCTGGGC CGGCCAGTAC CCCAGCCTAT GGGTGACGAA CGGCGGCGGC GGCACGTTCG CCAACCTCTG GAGCCCCAGC ACCTACGCCT ATTCCGGTAT CTATATCTCC GACACCCAAA CGCCTGGTCA TATCTACCAG GCCTCGGTCG AGCACCATGT GCGCACCGAG ATCAGCCTCA ATCGGGTGGC CCACTGGGAG TTGCTGGCGC CGCAGACCGA GGAGGAGGCG GGGGAGGGCG AGGACACCGT CTCGCTGGAA ATCCGCGACT CGCACGATAT TCTGGTCGCC AACTATCACG CCTATCGCGT GACCCGGACG CGCAAGCCGG CTGTGGCGGC GGCCAAGGTC TACAATTCCC ATGACATCCG CTTCCGCAAC GTGGCGGTGA ACGCCGAAAG CGGGGTGGGG CTCTGCGACG AGAACGGCTG CGCCACCTTC CTGCGTCCGA GCAAGTATCC GTACGAGAAC GCCATCCAGG ACGTCACCAG CGGACTGGAG GTGCGCGAGC GGCAGTTCGC GGTTCTGGAC CTGGCGGCCA ATCCGCCACC GGTCGCGCCG TCGACCTTCC CCGCCGGCGC CAAGGTGGAA AAACTGGGCG ACGGCTTCTA TTCGTTGGGC GGCGGCGCCG TCGATGCGGC CGGGACGCTC TACTTCACCG ACCGCCGTCA CCAGCGCATC TACAGCTGGT CCGAGGCGCG CAAGCTGTCG ATCGTGCGCG ACAACACGCT GGATCCCATC AACCTGGCGG TCGATGGGTC CGGCAACTTG CTGGTGCTGT CGTCGGATGG CCGCGACGGG TCGGTCTACA GCTTCAAGCC GGGATCCCCC GACACCGAGG TTACGGTGAT CACGCCGACC CCGGCGGCTG ACCACCCGGA CGCGGCGACC GTACTGCCCG TCAACTGGTG GAACAACGGC GAGTTCAAGG ACCAGCTGGA TCTCCAGACC TATCAGTTCA CCACCTTGGA GCAGATGTTC GCCCGCGACA TGGCGCTCCC GAAGCCCAAG GAATACATTT CGCCCGACGG CAGCCTGGCC TTGCCGGCGT ATCGCGTCTT CCAGCAGGGT CCGCCCGACT ACCGGGCCAT GCGATTTTCT GATTCGCTCG ACACCTACGG CTTCACGACG GCCAAGCCTG GCCAGCGGGT GTTCATCACC AACGGCTCGG AAAACAAGAC CTACAGCGGC CTGGTCGGAA GGGGCGGGGC GATCACCGAT CTGAAGCCCT TCGCCAACCG GGGCGGCGAG AGCGTCGCCA CGGACGGGGA GGGCCGCGTC TACGTCGCCA ATGGCCAGGT GTTCGTCTAC GCCCCGGACG GCCGGGAACT GGGGCGCGTG GATATTCCCG AACGACCGCT GCAGTTGGTC TTCGGCGGCC CTGACAGGCG GACGCTGTTT GTGCTGACCC ACCACGCGCT CTACGCGGTG CGGCGCCAAT AG
|
Protein sequence | MAAVVLLLAM STPATASVST FTAAPDDPRA VVARGKGDGR ADDSAALQSA IDAAAAKPGG GLVFLPSGRY RISKTIFLWP GVRVFGVGAT RPLILLADNA PGFQEGLANM VIFAGAKPDP SRRVPFPPPG SVPFNKDIAD ATPNSFYSAM SNIDFKIGRG NPAATAIRFH AAQHAYLSHM DFDLGSGLAG LYQVANEAED LHFKGGRYGI LTEKPSPAWG FVLLDSTFEG QRDAAIREHE AGLTLVNVAM RDTPVGIEID RGYGDWLWGK DVRFENVSKA GVVISNEASV YTQIGFENAV AVNTPVFARF RDSGKIVAGK GRVYGVASFT HGLTLPGPGQ MGEYRTEMKA DALGAPPKPR APAIRALPAM AEWANVRTLG AKGDNSTNDT AAIQRAIDTH RVVYFPAGFY VVADTLRLRP DTVLIGLHPS LTQIVLPDRT PGYQGVGAPK ALVESAAKGD NIVSGLGLNT GGVNPRATAL LWTAGADSMV NDVKFQGGHG TDLYDGSRFS PYNANGTADA DPAKRWAGQY PSLWVTNGGG GTFANLWSPS TYAYSGIYIS DTQTPGHIYQ ASVEHHVRTE ISLNRVAHWE LLAPQTEEEA GEGEDTVSLE IRDSHDILVA NYHAYRVTRT RKPAVAAAKV YNSHDIRFRN VAVNAESGVG LCDENGCATF LRPSKYPYEN AIQDVTSGLE VRERQFAVLD LAANPPPVAP STFPAGAKVE KLGDGFYSLG GGAVDAAGTL YFTDRRHQRI YSWSEARKLS IVRDNTLDPI NLAVDGSGNL LVLSSDGRDG SVYSFKPGSP DTEVTVITPT PAADHPDAAT VLPVNWWNNG EFKDQLDLQT YQFTTLEQMF ARDMALPKPK EYISPDGSLA LPAYRVFQQG PPDYRAMRFS DSLDTYGFTT AKPGQRVFIT NGSENKTYSG LVGRGGAITD LKPFANRGGE SVATDGEGRV YVANGQVFVY APDGRELGRV DIPERPLQLV FGGPDRRTLF VLTHHALYAV RRQ
|
| |