Gene GM21_3509 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3509 
Symbol 
ID8138881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4050429 
End bp4051763 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content61% 
IMG OID644871128 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_003023288 
Protein GI253702099 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value1.00438e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCAGG CGTGGGTTAG CTATCTGCCG GGATTCCTGC GCAAGCGGGT CGAGGGGCGG 
CACGAGCTGC AGAAGGTGCT GAACAACACC GGCTGGCTTC TCGGCGACCG CGTCCTCCGG
ATGGGGGTGG GTCTGCTGGT TGGCATCTGG ATAGCCCGCT ATCTTGGCCC CTCCAACTAC
GGCATGCTGA GCTACGCGGC GTCCCTGGTC GGCATCTTCA CCTCAGTGGC CATCCTTGGG
CTGGAAGGGA TCATTGTCCG GGACCTGGTG CGCTTCCCCG ACCGGGAAGG GGAGATCCTC
GGCACCACTT TTTCGCTGAG ACTCACGGCC GGGATCTGTT CCTATCTCCT CACCGTCGCC
ACCGTGTTCA TCCTCCGTCC TGGCGACGCC GTTTCCCAGA TGATGGTCGC GGTGATGGGG
TGGGTCCTGA TCTTCAACTC CGCCGATACC ATGGATTTGT GGTTTCAGTC CAAGGTACGG
TCGAAGTACG TGGTCTATGC CAAAAACGGC GCGTTCCTGC TGAGTTCCGC GCTGAGGCTG
GCCCTGGTGC TGATGGAGGC CCCGGTAGTT GCCTTCGCCG CCGCCAATGC GATGGAGGCG
GCGCTCGGAG CCGCAGGGCT CTTCTACGTC TACCATCGCG ACGGGCAGAT GGTCAGGCGC
TGGAAGGCGA GCCTCGCCCT GGCGCGCGAG CTGCTGAAAG ATTCCTGGCC CCTGGTCCTG
TCGGGCGTGG TGTACATGAT CTCGCTAAGG ATAGACCAGG TCATGCTGGG GCAGATGGCC
GATACCCACG AGGTCGGCAT CTACGCCTCG GCAGTCAAGA TCGCGGAGAT CTGGTTCTTC
ATACCAACCG CGCTCGTCAC CTCCGTCTTT CCCAACATCG TGAAGGCGAA GGAATCAAGC
GAGGAGGAGT TTCACGGCCG GCTGCAAAAG CTCTACAACC TGCTTGCCTT CACCGGGTAC
GCCATCGCCA TACCGACGAC GCTACTGGCT GGTTTCGTCG TTCACCTGCT TTACGGCGAT
GCCTATGTAG CCGCCGCACC GATGCTCATC TTCCTGATTT GGAGCGACCT GTTCATCAAC
ATCGGCGTGG CGCGGAACTC CTACCTGCTC GCCATGGGGT GGTCCTGGTG CTACTTCTGG
ATGGCGGTCT CGGGGATGGT GATAAACGTG GCCCTCAACC TCTTCCTGAT ACCGCGCTAT
GGTGGAACGG GAGCGGCGAT AGCGACCTGC ATTTCCTACT GGGTCGCGGC CCACGGCGCC
AGCTATTTCT ACAGGCCGTT ACGGAAGTCG GCGGGCATGA TCACCAGGGC GCTCCTCTGC
CCGAGGTTTT GGTAA
 
Protein sequence
MNQAWVSYLP GFLRKRVEGR HELQKVLNNT GWLLGDRVLR MGVGLLVGIW IARYLGPSNY 
GMLSYAASLV GIFTSVAILG LEGIIVRDLV RFPDREGEIL GTTFSLRLTA GICSYLLTVA
TVFILRPGDA VSQMMVAVMG WVLIFNSADT MDLWFQSKVR SKYVVYAKNG AFLLSSALRL
ALVLMEAPVV AFAAANAMEA ALGAAGLFYV YHRDGQMVRR WKASLALARE LLKDSWPLVL
SGVVYMISLR IDQVMLGQMA DTHEVGIYAS AVKIAEIWFF IPTALVTSVF PNIVKAKESS
EEEFHGRLQK LYNLLAFTGY AIAIPTTLLA GFVVHLLYGD AYVAAAPMLI FLIWSDLFIN
IGVARNSYLL AMGWSWCYFW MAVSGMVINV ALNLFLIPRY GGTGAAIATC ISYWVAAHGA
SYFYRPLRKS AGMITRALLC PRFW