Gene GM21_1711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1711 
Symbol 
ID8137042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1992875 
End bp1993888 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content64% 
IMG OID644869323 
Productglycosyl transferase family 2 
Protein accessionYP_003021523 
Protein GI253700334 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID[TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value2.9788799999999997e-27 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGACG CCATCATAGA CGTCATCATA CCCATTTGGA ACAGGCCCGA CGAGACCCGG 
AACTGCCTGG TCACCCTGAT CAAACACACA CCCGGCGCCC GATTCATCAT GGTGGACTGC
GGGTCCGAGC GGGATACCGA GAGGCTCCTG CAGGAACTCG CCGACAGCCT GGACGACCGC
GCGTTGTTGA TGCGCGATGA CAGCAACATC GGTTTCGTGC CTGCTGCGAA CCGCGGTTTC
GAAAGCTCCG AGGCGCCGTA CCTTGCTTTG GTGCGCAACA CCAGCCTGGT GAGCCCCAAT
TGGCTGGAGC CGCTGCTCGC CTATGCCCAA GAGCATCCGG AGGCGGGGAT TCTCCTTCCC
TGTCTCGATC CGGGCGAGGA GTGCAGCGTT ACGACCGAAC TCGAACGGGG CTCTTTCGCC
GCCATGGTGA TCGCCAGGGA ACTCTATCGC CGGATCGGAG GTCTCGACGA GGGGATGGAC
GGCGGCGTGT GGTGCCTCAA GGACTACACC CGGCGCGCCA ACGCCCAGGG GTTCATCACC
GTGCAGGTGC CCACCCCGGT GGTGCGCCAC CAGGAAGAAG TCCGGCTCGG TTCCGAGCAA
AGGCGGCGCG AAACGCAGCA GCGGAGCATC GCGCTTTTCA GGGAACGTTG GGGCGTGGGA
GGGAGCTACA TCCTTCATGT ACCCAAGGGG ATCGAAGTCG AGCTGCTGGG CGAAAAACTG
CAGTGGCTGG TAAAAGGGGC GCGGCACGAC GACAGCTTTA CCGTGCTGCT GCCGGCCTCC
TTGAACCAGG CCGCCCAGCA GGCGGGACTC GGGCGCCTGC ACGAGCACGT CACCCTGGTA
CCGCTCCCAA GGCTCGCCTG GGACGGCATG AAGAAGCGCC TCTTCGACAA GATCGTGTCC
CAGAAACCGG GGACCACCCC GGTCACAGCG GTGGATGGAA TACCCTTCCC CTGGAGCGAG
CGGTACCTGT CCTTCTCCGA GCTTTGCGAG AGGATCAAGG CCCGCTACCA GTAG
 
Protein sequence
MTDAIIDVII PIWNRPDETR NCLVTLIKHT PGARFIMVDC GSERDTERLL QELADSLDDR 
ALLMRDDSNI GFVPAANRGF ESSEAPYLAL VRNTSLVSPN WLEPLLAYAQ EHPEAGILLP
CLDPGEECSV TTELERGSFA AMVIARELYR RIGGLDEGMD GGVWCLKDYT RRANAQGFIT
VQVPTPVVRH QEEVRLGSEQ RRRETQQRSI ALFRERWGVG GSYILHVPKG IEVELLGEKL
QWLVKGARHD DSFTVLLPAS LNQAAQQAGL GRLHEHVTLV PLPRLAWDGM KKRLFDKIVS
QKPGTTPVTA VDGIPFPWSE RYLSFSELCE RIKARYQ