Gene GM21_3409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3409 
Symbol 
ID8138776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3940540 
End bp3941616 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content65% 
IMG OID644871026 
Productglycosyl transferase group 1 
Protein accessionYP_003023191 
Protein GI253702002 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones139 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATCG CCATGATCGA CAACAGCCGT GGCTGGGGTG GAGCCGAGCA GGTTATGCTC 
ATGGTCGCTT CCCATCTGCG TGAGCGCGGC CACGAGGTGA CCGTGTTCGT GCGGGAGGGG
GGCGCCCTGG TGGAGCCTTT CCGCAGGGCG GGGCACGATG TCTGCGCTGT ACCGCGCAAA
GGGCTCGGGG TCTTGGCCGG GATTGCCAGG ACGGCAAGCG CCATCCGTGG CGGCGGGTTC
GACCTGATCC ACGTGCACCG AAACCACGAC CTGGTGGTCG GCAAGATCGC TTCCGTGGCG
GCCGGGCTCC CGATGCTCCT CACCCAGCAC TGCCTTTTGG GGAACACATC CAGCTCGATC
ATCAACCTGG CCGACCGCGT CGTCGCGGTC TCTGGTTTCA TCGGCGACGA CATGAAGTGC
CGCTTCCCGG TTCTTTCCGG CAAGCTGCAG GTGATCCACA ACGGCATCGA TCTCACCCCG
TTCAAGGAGC CGAAGCCGGG CTTCTGGGAG AAGGTCCCGG CGGTCGCGGG CGCTAAGCCG
CTCTTGGGGG TTATCGGCTA CTTCTACAAG AATCAGGAAG AGCTCATCGC CATGATGCCG
CGCGTGCGGG AACGGCTGCC GCAGGCGAAG CTCGTCATCA TCGGCAAGGA CGACGAGAAG
CAGCCCGCCC TCGAGAAGCT TGCGGCCGAG TTGGGTGTGG CGGATGCCGT CTACTTCCCG
GGGAAGATTC CGTACGCCGA GATCGGTGAC GCCATGGCGG GGCTCGATTT CAACGTGAGC
GCCTTCCGGC GCGAGGGGTG CGCCCTGAAC GTATTGGAAT CTCTCGCGGT CGGCACCCCC
TTCGTCGGCT ACCGCTCCGG CAGCTATCCC GAGCTTGCCA TCGACGGAGA AACCGGGTTG
CTGGTCGACA ACCAGGACCA GTTCGTCGAC GCGCTGGCGC GTCTTTCGGC CGATCCCGAG
CTCGTCGCCT CAATGAGGAA GAGAGCCCGG GAGGATGCCC TTGTCCGCTT CGACCTGAAT
CGGATGGTTG AGGACTACCT GGACCACTAC CGGGAGATGA CGGGGGGAAA GCCGTGA
 
Protein sequence
MRIAMIDNSR GWGGAEQVML MVASHLRERG HEVTVFVREG GALVEPFRRA GHDVCAVPRK 
GLGVLAGIAR TASAIRGGGF DLIHVHRNHD LVVGKIASVA AGLPMLLTQH CLLGNTSSSI
INLADRVVAV SGFIGDDMKC RFPVLSGKLQ VIHNGIDLTP FKEPKPGFWE KVPAVAGAKP
LLGVIGYFYK NQEELIAMMP RVRERLPQAK LVIIGKDDEK QPALEKLAAE LGVADAVYFP
GKIPYAEIGD AMAGLDFNVS AFRREGCALN VLESLAVGTP FVGYRSGSYP ELAIDGETGL
LVDNQDQFVD ALARLSADPE LVASMRKRAR EDALVRFDLN RMVEDYLDHY REMTGGKP