Gene GM21_3816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3816 
Symbol 
ID8139190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4388315 
End bp4389454 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content65% 
IMG OID644871435 
Productglycosyl transferase group 1 
Protein accessionYP_003023593 
Protein GI253702404 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones101 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTAAAA CCTTACTGAT CGAGGGTTGG CGTTTCGTCC CCCACTCCTA CGCCTGCGTA 
AACATGTGGC AGTGCCTCGA ACTGATCAAG CGTCCCGACG TAGAGCTGTA CCATCGCGAC
CTTCCCTATT TCAAGCCCGA ATGGAAGCCG ACGCAGCACC TCTTCGACCC CGCCGCGACC
GCCGCACTAA AGGCCATCCC CCCCTTGCCG CCGGGAAAAA AGGCCGACCT TTTGCTCAGG
GTCGCTTTCC CCTACCGCTT CGACCGGAAC AACGCCGGCC GGCACCTCGT CTTCGGCACC
GCCGAGCACG GCATCGTCAC CCCCTCCATG GTGGAGGGCG GAGTCCCGCT GGCGCAGGCC
ATGGCGGACT CGGAAGCTTT GATAATCACC CCGAGCAACT GGTCCCGGGC CGGCTTCCTT
AGAAGCGGCG TCGCGCCGGA ACGGGTGGCC TTAGTGCCGC ACGGAGTGGA CCCGGGGATC
TTCCGGCCGC TGCCGGAAGC GGAGCGGGAA GCGCTCAGAC GACAGTTGGG GTGGCAGGAT
AATTTCGTCG TCCTGAACGT CGGCTGCATG ACCGGCAACA AGGGGGTGCG CTACCTGCTG
AAGGCATGTG CCGTGCTGCA AGAGCGCTTC CCGCAACTGA AGCTCTGCAT GAAGGGGCTC
GACCCGCTCT ACCCTTCGCG CCGGCTGCTG CAGGAGGCGG GAGACCTTTT GACCGCGGAA
GAGGGAACCC GCCTCGCCTC CTCGCTGGTC TACATCGGAG AGGACCTCTC CTTCTCCGAC
ATGGTCAGCC TCTACAACGC CGCCGACGCC TACGTCTCCC CCTACATCGC CGAGGGTTTC
AACCTGCCGG TCCTCGAAGC CGCCGCCTGC GGGCTCCCGG TCATCTGCAC GGCGGGTGGG
CCGACCGACG ACTTCGTCGA TGCGAGCTTC GCCAAAAGGA TAGACAGCAC GCTCATCCAG
AAAGACGGAT TGTTAGGCGT GCAGCCGGAC CTGGAGCACC TCGTCGAACT TATCGCGCAA
ACGGTCCAGG ACCACGAGTT CCGCCAGAAG GCCCGTGGAG CCGGCCCCTC CTTCGTGGCC
GGCTCCTTCA CCTGGCGCCA CGCGGTGGAG AAGCTGCTGA CGCTGCCGCA GTCGGACTGA
 
Protein sequence
MPKTLLIEGW RFVPHSYACV NMWQCLELIK RPDVELYHRD LPYFKPEWKP TQHLFDPAAT 
AALKAIPPLP PGKKADLLLR VAFPYRFDRN NAGRHLVFGT AEHGIVTPSM VEGGVPLAQA
MADSEALIIT PSNWSRAGFL RSGVAPERVA LVPHGVDPGI FRPLPEAERE ALRRQLGWQD
NFVVLNVGCM TGNKGVRYLL KACAVLQERF PQLKLCMKGL DPLYPSRRLL QEAGDLLTAE
EGTRLASSLV YIGEDLSFSD MVSLYNAADA YVSPYIAEGF NLPVLEAAAC GLPVICTAGG
PTDDFVDASF AKRIDSTLIQ KDGLLGVQPD LEHLVELIAQ TVQDHEFRQK ARGAGPSFVA
GSFTWRHAVE KLLTLPQSD