Gene GM21_3505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3505 
Symbol 
ID8138877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4046018 
End bp4047001 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content49% 
IMG OID644871124 
Productglycosyl transferase family 2 
Protein accessionYP_003023284 
Protein GI253702095 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value5.6726e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGAAAGT CAGAGAGGCA GGAGTCTGTA AAGTTATCCG TTCTGGTGCC AGTGTACAAT 
TGGGATGTCT CCTTATTAGT GCGCAAACTT ATTGAAGAAG CGGACTCCTC AGTACTGTGG
GCAAAGATTG AGATCATAGT TATCGATGAT CACTCGACAG ATCCGGTTAC GACAGATACT
AGCAAGATCT TAGAATGTGA GAACCAAAGG TCCGGATTCC ATTACTCCCG TCTGCCTCAA
AACGTCGGTA GATCAGCTGT ACGAAACTTG CTGGCAGCGA AGGCGAAAGG CGAATTCCTC
CTGTTTCTGG ATTGCGATGT TGCTCCGGAT TCTAAACATT TTCTTGCGTC GTATCTTGAG
TTTGCTGAGA AGGGCAGCCA CGATGTAATC TGTGGCGGTA GAAGCTATAA CTTACGAGTG
ATGACGGATG AAGAGTATGA CTACTACGTC TACTTTGGGA ATGTAAAAGA GGTTAAATCA
GCAGCCGAGA GAAATATTAT GCCATGGAGG CACCTTCTGA CTTCGAATGT GATGGTGCGT
AAGAAGGCTT TAGAAGAAAC CCCCTTCAAC GAAAACTTTG TGGGCTATGG GTATGAAGAC
ATTGAGTGGG GCGTCCGCCT GGCACAGGCA TACAGCATTC TGCACATCGA TAACACTGCC
TCTCATCTTG GTTTAGTCAC CAAACAAAAA GCCTACGAAA AAATGCGTGA GTCTGTGTCC
AACTACCTGC TCCTTAGGGA CCTTTACCCA CTCGCATTTA ATGTCTCTGC CATAAGCAAA
GTGGTACGCC TGCTGGAGTC TGTTCCTGCG CCGCTCCTGG GTGTGATGGA CCGGCTTCTG
AAAAACATGT TCCTGTCCAG CGGCAGCAAC CGCCTCGCTT TTCTCTTTTT TCAGCTCGAT
TTCGCGGTGC TTCTGGCGTG CACCCTAAGG GCGCGGCAGC GTGACCTGCT GGCCCCAAAG
CCGGCGCAAG GGGGGAAGCG TTGA
 
Protein sequence
MGKSERQESV KLSVLVPVYN WDVSLLVRKL IEEADSSVLW AKIEIIVIDD HSTDPVTTDT 
SKILECENQR SGFHYSRLPQ NVGRSAVRNL LAAKAKGEFL LFLDCDVAPD SKHFLASYLE
FAEKGSHDVI CGGRSYNLRV MTDEEYDYYV YFGNVKEVKS AAERNIMPWR HLLTSNVMVR
KKALEETPFN ENFVGYGYED IEWGVRLAQA YSILHIDNTA SHLGLVTKQK AYEKMRESVS
NYLLLRDLYP LAFNVSAISK VVRLLESVPA PLLGVMDRLL KNMFLSSGSN RLAFLFFQLD
FAVLLACTLR ARQRDLLAPK PAQGGKR