Gene Gmet_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGmet_2029 
Symbol 
ID3740686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter metallireducens GS-15 
KingdomBacteria 
Replicon accessionNC_007517 
Strand
Start bp2270200 
End bp2271756 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content52% 
IMG OID637779323 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_384983 
Protein GI78223236 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.911023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.105361 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCAA AGACGGAAGA GTTGAATAAA TACCTGAAAA TGCTCATGAA GAGGAGGTAT 
CTCTTTATTG TGGTATCTCT TGTGGTGATG TCTGTTATTG CCTGGGGAAG CTTTTTCCTC
CCCAAGAAAT ATGAGGCATC GAGCACGGTA TTCATCGAAA AGAGCGTCAT CAAGGATCTG
GTGAAGGGGA TCACCTTTAC CCCTTCTGTC GAGGACAAGG TTCGCATCCT GCGGTATGCC
ATGCTCAGCC GCACTTTCGT GACCAATGTT CTGAAATCAC TTGATGCTGA TACGAAGGTA
AAGAACGATA AAGAGATGGA AGGTCTTGTT GAAGACTTCC AGAAAAGGAC GCAAATAAGC
ATCAAGGGTA ATGACCTTTT TATCGTCAGC ATCCGGGACA AGGATCCGAA GCTCGCTACT
GACTACGTCA ACACGCTTGT CAGAAAGTAT GTGGAGGAAA ACGTCTCCAG CAAGCGGCAG
GATTCCTTTG GCGCTGATCG GTTCATCTCC GAGCAGCTAA AAACCTTCAA GGACAAGCTT
GACGAGTCCG AGAACAAGAT CGTTGCTTTT CGTCAGAAAC GGGGCGTGTC GGTGGGGATT
GACGAGGCGC TTCTCGTTAA TGACATTCGC CAATATCAGG GAGAACTCGA CTCAATGCGG
ATCAAACGGA ATGAGTTGAC CGCAACACGG GATGCTCTCA GGCGCCAGCT CAAGAGCATC
AAGCCCACCA CCGTCGCTCT CTCGTCCCGG GAGAACTCGA GCGAGGTTGA GATGCTCGAG
CGCAGGCTCA AACAGCTCTC TGCCAATTAC ACCGACAACT ATCCCGAGGT AATTCGCATC
AAGAGTATCA TCGCATCGCT CAAGAAAAAG CAGGAGCCGG GCCATCAGGC TGATACAGGG
GCGAAAGAGG AATTCAGCAC GGCCAACCCC GTGTACCAGA ATCTCGAGCA GCAGTTGTAC
CAGGTTGAGG CAGAACTTGA GGCGGTCAAC GCCAAACAGC GCCAGCTCCA TGCAACCATA
GGCGGCAAGG AACATGAACT TCGAAACGTT CCGGCTGACC AGAAGACACT GACTGACCTC
ATCAAGGAGC GCGACGCCAA TCGGCAGTTG TACGAGCAGC TTCTCACCCG GCAGGGACAG
GCTCAACTCA CGAAGGAGAT GGAGGTAGAG GACAAGGCCA CGACGTTCAG GGTTGTGGAC
CCGGCCATCG TGCCGATGAA ACCGGTCAGC CCCGACCGGG TCAAGATGAT CATTATGGGC
ATCATCATGG GATTCGTTGC CGGCGCCGCC TCAGTCTTTG TCATGGAGAT GTTCGACTCC
TCCGTCAAGG ATGTCACCTC TCTCAAGAAG CTTGGTTTTG AGGTGCTCGC AGTAATCCCG
ACCATTTTCA ACCAGGAAGA AGCAAGCAAG GTAGCGAAGA AAGATCGAAA GATATATCTG
GTCGCGGGCT GTTACTTCGC TCTCATCTGC TTGATGCTTA CCCATGAATT GCTGGGATTG
ACTCTGATCG AGAAGGTCCT CACCAAACTG GGGCTTGATC AGTTCATCAT GAGCTGA
 
Protein sequence
MVSKTEELNK YLKMLMKRRY LFIVVSLVVM SVIAWGSFFL PKKYEASSTV FIEKSVIKDL 
VKGITFTPSV EDKVRILRYA MLSRTFVTNV LKSLDADTKV KNDKEMEGLV EDFQKRTQIS
IKGNDLFIVS IRDKDPKLAT DYVNTLVRKY VEENVSSKRQ DSFGADRFIS EQLKTFKDKL
DESENKIVAF RQKRGVSVGI DEALLVNDIR QYQGELDSMR IKRNELTATR DALRRQLKSI
KPTTVALSSR ENSSEVEMLE RRLKQLSANY TDNYPEVIRI KSIIASLKKK QEPGHQADTG
AKEEFSTANP VYQNLEQQLY QVEAELEAVN AKQRQLHATI GGKEHELRNV PADQKTLTDL
IKERDANRQL YEQLLTRQGQ AQLTKEMEVE DKATTFRVVD PAIVPMKPVS PDRVKMIIMG
IIMGFVAGAA SVFVMEMFDS SVKDVTSLKK LGFEVLAVIP TIFNQEEASK VAKKDRKIYL
VAGCYFALIC LMLTHELLGL TLIEKVLTKL GLDQFIMS