Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3775 |
Symbol | |
ID | 8139149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 4348014 |
End bp | 4349210 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644871394 |
Product | poly-gamma-glutamate synthesis protein (capsule biosynthesis protein) |
Protein accession | YP_003023552 |
Protein GI | 253702363 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 111 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAAATC CTATATCCAT CTTCATGTGC GGCGATGTCA TGACCGGTAG GGGGATCGAC CAGATACTTC CTCACCCGGT CGATCCGCAC ATCTACGAGT GGTTCGTGAC CGATGCCCGC AGTTATGTGG ATCTTGCAGA AAAGGTCAAC GGCCCGATAC CAAGGCAGGT CTCATGCGAT TACTTCTGGG GTGACGCCCT CGGCTTCTTC CGCAGGCTTC GCCCGGACGT GAAACTGATC AACCTCGAAA CCAGCGTGAC CACATCGGAG GAATTCTGGC CGGGCAAGGG GATCAACTAC CGCATGCACC CGCTCAATTT CCCGGCCATC ACGACCGCCG GGATTGACGT CTGCGCTTTG GCGAACAATC ACGTCATGGA CTGGGGGTAC CAGGGGTTGG AAGAAACGCT TCGGACGCTG GAGAAATCCG GTGTGCGATG CGCCGGGGCG GGGCATGACC TTTCGTCGGC GGCGGAACCT GCCACCGTTG CAGTTTCGGG TAAGGGGAGG GTGCACGTCT TCTCCTTGGG GGATGCCTCC AGCGGCATCC CTTCCGAATG GGGAGCCGGA AATGAACTGG CGGGCGTGAA CCTGCTGCCG GACCTGTCGG ACCGGACGGC GGAGAGGTTG AGAGAGCAGG TGCGGCAGGT GAAGCAGGAG GGGGACGTAG TGGTAGCCTC CATCCACTGG GGAAGCAACT GGGGATTCGA GGTGCCGCGG GAGCAGATCG AATTCGCCCA TCGTCTCATC GACAGCGCCG GGGTCGATGT GATCCATGGC CATTCCTCGC ATCATGTGAA AGGGGTGGAG GTGTACCGTG GCAAGCTGAT CATCTACGGC TGCGGTGATT TCCTGACCGA CTACGAGGGG ATAAGGGGGA AGGAAGCGTA TCGCGGCGAC CTTGGGTTCA TGTACTTTGC TTATCTGGAT GGGGAGACCG GTGCGTTGAA GGAACTGAGG CTGATACCGA CGAAAGTGCG GAAGTTCCAG GTCGTCAGGG CTAGGGGCGC CGACTGGCGC TGGCTGCGTG ACACCATGAA CCGCGAAGGG AAGATGTTGG GGACTGGGGT GGAAGAAGCG GAAGATCGGG TGCTGCTGCT AAGATGGGCA GAACCTGTGA ACCGAAAGCG CAAAAGCCTT TCGCGTGGAG GTGGGGGGAA ATGCGAAGGG CCGCCATGGG CGGCCCTTGT CGGTTGA
|
Protein sequence | MQNPISIFMC GDVMTGRGID QILPHPVDPH IYEWFVTDAR SYVDLAEKVN GPIPRQVSCD YFWGDALGFF RRLRPDVKLI NLETSVTTSE EFWPGKGINY RMHPLNFPAI TTAGIDVCAL ANNHVMDWGY QGLEETLRTL EKSGVRCAGA GHDLSSAAEP ATVAVSGKGR VHVFSLGDAS SGIPSEWGAG NELAGVNLLP DLSDRTAERL REQVRQVKQE GDVVVASIHW GSNWGFEVPR EQIEFAHRLI DSAGVDVIHG HSSHHVKGVE VYRGKLIIYG CGDFLTDYEG IRGKEAYRGD LGFMYFAYLD GETGALKELR LIPTKVRKFQ VVRARGADWR WLRDTMNREG KMLGTGVEEA EDRVLLLRWA EPVNRKRKSL SRGGGGKCEG PPWAALVG
|
| |