Gene GM21_3516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3516 
Symbol 
ID8138888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4057434 
End bp4058378 
Gene Length945 bp 
Protein Length314 aa 
Translation table11 
GC content60% 
IMG OID644871135 
Productglycosyl transferase family 2 
Protein accessionYP_003023295 
Protein GI253702106 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.00000921398 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGATCTCA GCATCGTAGT CCCCATTTAC AACGAGGAAG ACAACATTCC CATCCTGCAC 
GACCGGGTCA GCGAGGCGTT GGGCGACACC CTGCTCGAGT ACGAGCTGAT CCTCGTCGAC
GACGGCTCTT CGGACAACTC CTATTCCGGG CTGAAGCGCC TGGCGGCGAA AGACGACCGG
GTCAAGGTGA TACGTCTGCG CCGCAATTTC GGCCAGACCG CCGCCATGTC CGCCGGCTTC
GACTTAGCCT CAGGCCGGGT GGTGATTCCC ATGGACGGGG ACCTGCAGAA CGATCCGCTC
GACATCCCGC TGCTTTTGGC GCGGATCGAC GAGGGGTACG ACGTGGTATC CGGGTGGCGC
AAGGACCGCA AAGACACATT CGTGAACCGC AAGCTCCCTT CCATGCTTGC CAACGGCATC
ATCTCAAGGA TGACCGGCGT ACATCTGCAC GACTACGGCT GCACCCTGAA GGCCTACCGT
CGCGACGTGC TGGACGACGT GAACCTTTAC GGGGAGATGC ACCGCTTCGT TCCCGCGCTG
GCGCACCAGG TCGGCGCCCG GGTAACCGAA ATGCCGGTGC GTCACCACGA AAGGCTGCAC
GGCAATAGCA AGTACGGCAT CTCCCGCACC ATGAAGGTCA TCCTCGACCT GATGACGGTT
AAATTCCTAT TGAGCTACTC GACCAAGCCG ATCCAGCTCT TCGGCCGCTG GGGGATCTAC
ACCCTCGCCG CCGGGTTCCT AAGCGGCGCG GTCACCGTCT ACATGAAGTT CTTCGAAGGC
ATGAGCATGA ACCGCAACCC GCTCCTCATC CTGACCGCTT TCCTCCTTTT CATGGGGGTT
CAGTTCATCG TCCTCGGGCT TTTGGCCGAG CTCTCCGCCA GGACCTATTA CGAGGCGCAG
GGAAAGCCGA TTTACAACAT AAAGGAAAAG CTCAACTTTG GCTGA
 
Protein sequence
MDLSIVVPIY NEEDNIPILH DRVSEALGDT LLEYELILVD DGSSDNSYSG LKRLAAKDDR 
VKVIRLRRNF GQTAAMSAGF DLASGRVVIP MDGDLQNDPL DIPLLLARID EGYDVVSGWR
KDRKDTFVNR KLPSMLANGI ISRMTGVHLH DYGCTLKAYR RDVLDDVNLY GEMHRFVPAL
AHQVGARVTE MPVRHHERLH GNSKYGISRT MKVILDLMTV KFLLSYSTKP IQLFGRWGIY
TLAAGFLSGA VTVYMKFFEG MSMNRNPLLI LTAFLLFMGV QFIVLGLLAE LSARTYYEAQ
GKPIYNIKEK LNFG