Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3401 |
Symbol | |
ID | 8138768 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3932574 |
End bp | 3933692 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644871018 |
Product | CDP-glucose 4,6-dehydratase |
Protein accession | YP_003023183 |
Protein GI | 253701994 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | [TIGR02622] CDP-glucose 4,6-dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 136 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCGA TGGGCAATAA AAAACAGCAC CACTTCGATC CCGAATTCTT CCGCGGCAGG CGGGTCCTGG TAACCGGCCA TACCGGTTTC AAGGGGAGCT GGCTCTCTCT TTGGCTGCAC CGTCTGGGGG CGCAGGTGAC CGGCTACGCG CTCTCTCCGC CGACCGACCC GAGCCTCTTC GAGCTGGCCG GCTTGGCGGA GCTGGTGCAC TCGGTGACCG GCGACGTGCG CGACCAGGCG GCGCTTACAA CTGCCGTGAA GGACGCTGCC CCGGAGATCG TGATCCACAT GGCGGCGCAG CCACTGGTGC GCGAGTCTTA CCTGAACCCG GTGGAGACCT ACTCGACCAA CGTGATGGGA ACGGTGCACC TCTTGGAGGC GGTGAGAAAC TCTCCTGGCG TCAAGGCCGT CGTGAACGTC ACCACCGACA AGTGCTACGA GAACCGCGAG TGGGCCTGGG GGTACCGCGA GAACGAGCCG ATGGGGGGGT ACGATCCCTA CTCCAGCAGC AAGGGGTGCT CGGAACTCGT CACCGCCGCC TATCGCAACT CCTACTTCAA CGAGACCCGC TATGCCGACC ACGGCGTGGC GCTCGCCTCG GCCCGCGCCG GAAACGTCAT CGGCGGCGGC GACTGGGCGG GGGACCGCCT CATCCCCGAC TGCGTTGCGG CCCTTTTGAA GCACGAGCCT GTGCGGATCA GAAACCCGCA CGCCATACGC CCCTGGCAGC ACGTGCTGGA GCCCCTGTCC GGCTACCTGA CCCTGGCCCA GAGGCTGTAC CAGGAGGGGC CGCGCTACGC CGGCGCCTGG AACTTCGGGC CCGGAGACGA CGATGCCCGC GAAGTCGAAT GGATCGTTAA GAGGATGTGC AGCCGCTGGC AGGGCGAGGC CCGCTACGAG GTGGACCAGG GGGAGCATCC GCACGAGGCG CACTACCTGA AGCTCGACTG CTCCAAGGCC AAGGCGAAGC TGGGCTGGAG TCCGCGCTGG AGCCTGGAGA CTGCCATCGA GAAGATCATC GACTGGTCCC TCGCCTATCA AAGAGGCGAC GAGCTGCGCG CGGTTTGCCT GCAGCAGATC GACTCGTACT CTGCTGCGGG CGACGGTAAG CGCGCATGA
|
Protein sequence | MISMGNKKQH HFDPEFFRGR RVLVTGHTGF KGSWLSLWLH RLGAQVTGYA LSPPTDPSLF ELAGLAELVH SVTGDVRDQA ALTTAVKDAA PEIVIHMAAQ PLVRESYLNP VETYSTNVMG TVHLLEAVRN SPGVKAVVNV TTDKCYENRE WAWGYRENEP MGGYDPYSSS KGCSELVTAA YRNSYFNETR YADHGVALAS ARAGNVIGGG DWAGDRLIPD CVAALLKHEP VRIRNPHAIR PWQHVLEPLS GYLTLAQRLY QEGPRYAGAW NFGPGDDDAR EVEWIVKRMC SRWQGEARYE VDQGEHPHEA HYLKLDCSKA KAKLGWSPRW SLETAIEKII DWSLAYQRGD ELRAVCLQQI DSYSAAGDGK RA
|
| |