Gene GM21_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3775 
Symbol 
ID8139149 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4348014 
End bp4349210 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content60% 
IMG OID644871394 
Productpoly-gamma-glutamate synthesis protein (capsule biosynthesis protein) 
Protein accessionYP_003023552 
Protein GI253702363 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2843] Putative enzyme of poly-gamma-glutamate biosynthesis (capsule formation) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones111 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAATC CTATATCCAT CTTCATGTGC GGCGATGTCA TGACCGGTAG GGGGATCGAC 
CAGATACTTC CTCACCCGGT CGATCCGCAC ATCTACGAGT GGTTCGTGAC CGATGCCCGC
AGTTATGTGG ATCTTGCAGA AAAGGTCAAC GGCCCGATAC CAAGGCAGGT CTCATGCGAT
TACTTCTGGG GTGACGCCCT CGGCTTCTTC CGCAGGCTTC GCCCGGACGT GAAACTGATC
AACCTCGAAA CCAGCGTGAC CACATCGGAG GAATTCTGGC CGGGCAAGGG GATCAACTAC
CGCATGCACC CGCTCAATTT CCCGGCCATC ACGACCGCCG GGATTGACGT CTGCGCTTTG
GCGAACAATC ACGTCATGGA CTGGGGGTAC CAGGGGTTGG AAGAAACGCT TCGGACGCTG
GAGAAATCCG GTGTGCGATG CGCCGGGGCG GGGCATGACC TTTCGTCGGC GGCGGAACCT
GCCACCGTTG CAGTTTCGGG TAAGGGGAGG GTGCACGTCT TCTCCTTGGG GGATGCCTCC
AGCGGCATCC CTTCCGAATG GGGAGCCGGA AATGAACTGG CGGGCGTGAA CCTGCTGCCG
GACCTGTCGG ACCGGACGGC GGAGAGGTTG AGAGAGCAGG TGCGGCAGGT GAAGCAGGAG
GGGGACGTAG TGGTAGCCTC CATCCACTGG GGAAGCAACT GGGGATTCGA GGTGCCGCGG
GAGCAGATCG AATTCGCCCA TCGTCTCATC GACAGCGCCG GGGTCGATGT GATCCATGGC
CATTCCTCGC ATCATGTGAA AGGGGTGGAG GTGTACCGTG GCAAGCTGAT CATCTACGGC
TGCGGTGATT TCCTGACCGA CTACGAGGGG ATAAGGGGGA AGGAAGCGTA TCGCGGCGAC
CTTGGGTTCA TGTACTTTGC TTATCTGGAT GGGGAGACCG GTGCGTTGAA GGAACTGAGG
CTGATACCGA CGAAAGTGCG GAAGTTCCAG GTCGTCAGGG CTAGGGGCGC CGACTGGCGC
TGGCTGCGTG ACACCATGAA CCGCGAAGGG AAGATGTTGG GGACTGGGGT GGAAGAAGCG
GAAGATCGGG TGCTGCTGCT AAGATGGGCA GAACCTGTGA ACCGAAAGCG CAAAAGCCTT
TCGCGTGGAG GTGGGGGGAA ATGCGAAGGG CCGCCATGGG CGGCCCTTGT CGGTTGA
 
Protein sequence
MQNPISIFMC GDVMTGRGID QILPHPVDPH IYEWFVTDAR SYVDLAEKVN GPIPRQVSCD 
YFWGDALGFF RRLRPDVKLI NLETSVTTSE EFWPGKGINY RMHPLNFPAI TTAGIDVCAL
ANNHVMDWGY QGLEETLRTL EKSGVRCAGA GHDLSSAAEP ATVAVSGKGR VHVFSLGDAS
SGIPSEWGAG NELAGVNLLP DLSDRTAERL REQVRQVKQE GDVVVASIHW GSNWGFEVPR
EQIEFAHRLI DSAGVDVIHG HSSHHVKGVE VYRGKLIIYG CGDFLTDYEG IRGKEAYRGD
LGFMYFAYLD GETGALKELR LIPTKVRKFQ VVRARGADWR WLRDTMNREG KMLGTGVEEA
EDRVLLLRWA EPVNRKRKSL SRGGGGKCEG PPWAALVG