Gene GM21_3506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3506 
Symbol 
ID8138878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4047037 
End bp4047993 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content51% 
IMG OID644871125 
Productglycosyl transferase family 2 
Protein accessionYP_003023285 
Protein GI253702096 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.33454e-25 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGTTCTT TCAGTCCCCT TGTTTCCATC ATCATTCCCG TCTACAACGG CTCCGACTAT 
CTGCGCGAGG CGATTGACAG CGCCTTGGCG CAAAGCTACG GCAATATCGA GATCCTCGTC
GTCAATGATG GGTCGCGTGA CGAGGGAAAG ACAGAGGCGA TAGCCATCTC CTATGGGGAC
AAGATCCGTT ATATTCCCAA GCAAAACGGC GGCGTCGCCT CCGCGCTCAA TCTCGGCATC
CGGGAAATGA CTGGGGAATA TTTCACTTGG CTTAGCCATG ACGACCTCTA CTATCCCGCC
AAGACGGAAG AACAGGTGCG GCTTTTGGAA GAGGCCGGCG GCGATGCTGT CGTCTACAGC
GACTACGAAT ATATAGACCC GAGTGGCAAC TATCTAGGAA CCAAGATGGC GAAGTCCAGC
GGGGTCAAAT ACTCACTGAT CATGGAAGGG ACCATCAACG GGTGCACGGT GATGATCCCG
CGACGATACT TCGACGAGAT AGGTCTTTTT GATCTGGCAC TCAGGACGAC GCAGGACTTT
GACATGTGGT TCAGGCTGGC GGAGCATTAC CCGTTCCTGC ACCAGGCCAA GGTGTTGGTC
AAGTCGCGAG TGCATCCAAA CCAGGGATCA TTGACAATCC CGACCCACAT AGAAGAGCAA
AATGACTTGT ACGTCAGGAC TCTGGGCACC TTCGAAGAAA AGGATCTGCC TGACACCGAT
ACGATGGCTT CCTTTTATTT GAAAGCTGCT ATCCGGTTCG TCTCGATGGG GTGCAATCAA
GCTGCGGACC ATTCTGCTTC TATGTTTTGG AGTGCATTTC AGAGAGACTG TATTGGTAAG
CGCTTGTCTC ATCTTGCCCT ATTCGCAGTT TACCGGCTGC TGCGGACTGC TTACAGAAGC
ACACGCGTGA AAAAAATAAT AAAAAGAGCA AACAAGATAC TGACTTCTTT CAATTAA
 
Protein sequence
MSSFSPLVSI IIPVYNGSDY LREAIDSALA QSYGNIEILV VNDGSRDEGK TEAIAISYGD 
KIRYIPKQNG GVASALNLGI REMTGEYFTW LSHDDLYYPA KTEEQVRLLE EAGGDAVVYS
DYEYIDPSGN YLGTKMAKSS GVKYSLIMEG TINGCTVMIP RRYFDEIGLF DLALRTTQDF
DMWFRLAEHY PFLHQAKVLV KSRVHPNQGS LTIPTHIEEQ NDLYVRTLGT FEEKDLPDTD
TMASFYLKAA IRFVSMGCNQ AADHSASMFW SAFQRDCIGK RLSHLALFAV YRLLRTAYRS
TRVKKIIKRA NKILTSFN