Gene GM21_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3501 
Symbol 
ID8138873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4039581 
End bp4040573 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content62% 
IMG OID644871120 
Productglycosyl transferase family 2 
Protein accessionYP_003023280 
Protein GI253702091 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value8.137120000000001e-33 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGAGCCGGC CGCAGAAAGG ACCGGGCGCG CCGTCGCAGG AGGGGGCGCG GATAGCGGTA 
GTGATTCCTT CCTATAAGGT GAAGCAGCAT GTGCTTCAGG TGATCTCTGC CATAGGCCCC
GAAGTCTCCA GCATCGTCGT GGTGGATGAC GCCTGCCCCG ACGGTTCCGG CCGCTACGTT
GAAGAGAACT GCCGCGACCC GCGCGTGCTG GTTTGCTCCC ACACCGAAAA CCGCGGGGTC
GGTGGCGCCA CGCTCACCGG GTATCAGGCG GCTCTGGACC AGGGCGCGGA CATCATCGTC
AAGCTGGACG GCGACGGGCA GATGGACCCG GCCCTCATCC CGAAGCTGGT GCGGCCGATA
GTCGACCAGG TCGCGGACTA CAGCAAGGGG AACCGCTTCT ATTCCGTCGA AGATCTCCAG
CAGATGCCTT TCGCGAGGCT GGTGGGAAAT TCAGTGCTCT CCTTCATGGC CAAGTTCTCC
ACCGGGTATT GGACCATCTT CGACCCCACC AACGGCTTCA CCGCGATCCA CGGCGCCGTC
GCGGCGCTGC TTCCGCTGGA AAAGATCGAA AAGAGGTATT TCTTCGAGTC CGACATGCTG
TTCCGGCTCA ACACCCTGCG CGCGGTGGTG GCCGACGTTC CCATGCGTGC CAGATACGCC
GACGAGAAGA GCAACCTCAG CATCCTCGGG GTCATTCCCG AGTTTCTCAG GAAGCACGCG
GTGAACAGTT GCAAACGGAT TTTCTACAAC TACTACCTGA GAGACTTCAG CGCCGCCTCG
GTAGAAGTGG TGCTCGGCCT CTGCGCCCTT TTGTTCGGGG TCGTCTTCGG TTCGTGGACC
TGGTACGGTT CGATTCGGAC GGGGGTCCCG GCGACCAGCG GGACGGTCAT GCTGGCTGCG
CTCCCGACCA TGCTGGGGAT GCAGCTTTTC CTTGCCTTTC TTTCCTACGA TACGGCCAAC
GCGCCCAAGT ATCCCCTGCA CAGAAGGCTT TAA
 
Protein sequence
MSRPQKGPGA PSQEGARIAV VIPSYKVKQH VLQVISAIGP EVSSIVVVDD ACPDGSGRYV 
EENCRDPRVL VCSHTENRGV GGATLTGYQA ALDQGADIIV KLDGDGQMDP ALIPKLVRPI
VDQVADYSKG NRFYSVEDLQ QMPFARLVGN SVLSFMAKFS TGYWTIFDPT NGFTAIHGAV
AALLPLEKIE KRYFFESDML FRLNTLRAVV ADVPMRARYA DEKSNLSILG VIPEFLRKHA
VNSCKRIFYN YYLRDFSAAS VEVVLGLCAL LFGVVFGSWT WYGSIRTGVP ATSGTVMLAA
LPTMLGMQLF LAFLSYDTAN APKYPLHRRL