Gene GM21_4112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4112 
Symbol 
ID8139486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4697050 
End bp4698207 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content65% 
IMG OID644871727 
Productglycosyl transferase group 1 
Protein accessionYP_003023885 
Protein GI253702696 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGAG CGATCATCGT ATTCTCGCAC CTGCGCTGGA GCTTCGTGTA CCAGCGCCCA 
CAGCAGTTGT TGACCCGGAT GGCGGGCAGG CGCCGGGTGA TCTTCTTCGA AGAGCCGCTT
TACGACGCCG GGCGTGCGCC GTTTCTGGAA TGCAGCACGC CGGAGCCGGG GGTGCTGGTC
TGCAGGCCCC ATACCCCGTC CCAAAAATCG GGGTTTCACG ACGAGCAGCT GCACTGGCTG
GCGCCGCTTC TGGAAGAGCT GGTGGCGCAG GAGGAACTGA GCCGGTATAT CGTCTGGTTC
TACACCCCGA TGGCGCTTCC CCTGGCAAGG GTGCTCCGCC CCTCCCTGGT GGTGTACGAC
TGCATGGACG AGTTGACCGG TTTTCTGGAG GCGCCGAAGG AACTGGTGCA GCGGGAAAAG
GCGCTGCTGG CAGTGGCGGA CCTAGTTTTT ACCGGCGGGC CCAGCCTGTA CCAGGCCAAG
AAGAGTCATC ACCCCGAGGT GCACTGCTTC CCGAGCAGCG TCGACGCCTC CCATTTCGCG
CTCGCCTGCG ATCCGGAGTG CGAGCACGCG ACCCAGAAGG CCCTCCCCAA GCCGAGGCTC
GGCTACTTCG GCGTGCTGGA CGAGAGGCTC GACCTGCAGC TTCTGCACAC CTTGGCGCTA
TCCCATCCCG ACTGGCAGAT CGTCATGGTC GGCCCGGTGC TGAAGATCTC TCCAGAGCTC
CTCCCCAGGG AGCCGAACAT CCACTACTTC GGGCAGCAGG AATACGCCGC TCTCCCCGGT
TACCTGGCCG GGTGGGACGT CTGCCTCATC CCGTTCGCAT TGAACGACGC CACGCGCTTC
ATCAGCCCCA CCAAGACCCT GGAGTACATG GCCGCGGAGA AGCCGGTGGT CAGCACCCCC
ATCACCGACG TGGCGGTCCC CTACGGCGAC ATCGTCTTCA TCGGGGACGG CATCGGCAAC
TTCATAGCCG CCTGCAAGAA AGCCCTGGCG CTTTCGCCGA ACCGGTACCG GGAGATGGTA
GGCGCAATGC GCCAGGTGCT CGCGGGGACT TCGTGGGACG CTACGGTGCA GGGGATGAAC
CAACTGATCG ACCGGGCGGT CCGGCGCAAG AGGGCGCGGC CGGTGCGCAG CGAAACGGTG
GCTTCGGAGA ACGTTTGA
 
Protein sequence
MPRAIIVFSH LRWSFVYQRP QQLLTRMAGR RRVIFFEEPL YDAGRAPFLE CSTPEPGVLV 
CRPHTPSQKS GFHDEQLHWL APLLEELVAQ EELSRYIVWF YTPMALPLAR VLRPSLVVYD
CMDELTGFLE APKELVQREK ALLAVADLVF TGGPSLYQAK KSHHPEVHCF PSSVDASHFA
LACDPECEHA TQKALPKPRL GYFGVLDERL DLQLLHTLAL SHPDWQIVMV GPVLKISPEL
LPREPNIHYF GQQEYAALPG YLAGWDVCLI PFALNDATRF ISPTKTLEYM AAEKPVVSTP
ITDVAVPYGD IVFIGDGIGN FIAACKKALA LSPNRYREMV GAMRQVLAGT SWDATVQGMN
QLIDRAVRRK RARPVRSETV ASENV