Gene GM21_2459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2459 
Symbol 
ID8137800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2874007 
End bp2875320 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content58% 
IMG OID644870069 
ProductO-antigen polymerase 
Protein accessionYP_003022260 
Protein GI253701071 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR03097] probable O-glycosylation ligase, exosortase system type 1-associated 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones152 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAAGGG GAGCCTTGTT GTACGTCCTG TTGCCGGTAC TCGTTTTCTG GATACTGAGC 
AACACCTACG TTGGGCTCGT GCTTTACACC TGCGCCAACA TCATCCGTCC GGAGATGTTT
TTTTGGGGAG GCAACCAGGG GGCGATCGTC TTCAAGGTTT TCATAGGCGC TGCCATGATC
TCCTTTCTTC GCAATCCCGG TGGCAGGTTG ACCAGCGCCT TCGCCTGCCG CGACTTCTGG
CTCATCATCT GGATCTGGGT GGCGCTTTTG GTCTCAATCT GGTTTTCGCC CTACCCGACC
AACCCCTTGG CCTACGAGTT CGCCAACGAG TTTCTCAAGC TGGCCGTGCT CCTGTGGCTC
GTACTGGGGC TTCTTCACAG CGAAGAGCGT ATCGTGCACT ACGAGCGCAC CATGCTGGGT
GCCTTTAGTT TCCTTGCCCT CTGGGGGGTC GAGCAGCATT TCCGGGGCAA CGAGCGCCTT
GAGGGGCTCG GGGGGGCGGC TTTTGGCGAC TCGAACGGTG TGGCGGCTAT CCTGGTACTT
GCTTTTCCGC TGGCCTTGAA TCTTGCCCTG CACAGTTCCG AAAAACGTGA ACGCTGGCTT
GGATGGGGGG CCTCCGCGGC AATCCTTTTG GCCGTAATCG CCACTCAGTC CCGGGGAGGG
TTCCTGGGCT TGAGCGTAGC TGCGGCGGTG TTCTTCCTGC ACTCCCGAAA GAAAAAGCAG
TTGATCCTGA TCACCTCCGT CCTGCTGGGA GTGGCAGCGC TGTTTGTCAC TGACGGTTAC
CTGAACCGGA TGTCGACCAT CACCGCCGAG GAGGAAGAGC GGGACCTTTC CTCCGGTTCC
CGATTGGTGC TTTGGAAAGC CGGAATGATG GTATTCCGCG ACAACCCGAT CATCGGCACC
GGGTTTCTCT CCTTTCCGAT CGCGAAGCAG GAATACAAGG ACGCCGTGCC CAACGTCGAC
GAGAAACTGC GTGAATACTC CTTCCGCGGC TACAAAGTGA GCCATAACTC GTACGTCCAG
GCATTGTCGG AGGGGGGGCT ATTCCTCTTC GTTCCCTATG CGCTGCTGAT ACTGGGCTGC
CTTTGGGGTA ACAGGGCACT GCGCCGGGCG AAGCGGGACC AGGCGGAGCA CCGGTTGATG
CTCCTTCTGA ACGGCATCGA GGCGGGAATC GTCGGCTATT GCGTCTGCAT CGTCTTTATC
AACGCCCTCA CCGTAGTACT TCTTCCCGTT CAGATAGTCG TTTCGCGGGT CATCCGAGAC
ACCCTGCAGC GTGAGGAAGA GTCCGCTCTA CCTGCTGTTG AAGCCGCCTT GTAA
 
Protein sequence
MLRGALLYVL LPVLVFWILS NTYVGLVLYT CANIIRPEMF FWGGNQGAIV FKVFIGAAMI 
SFLRNPGGRL TSAFACRDFW LIIWIWVALL VSIWFSPYPT NPLAYEFANE FLKLAVLLWL
VLGLLHSEER IVHYERTMLG AFSFLALWGV EQHFRGNERL EGLGGAAFGD SNGVAAILVL
AFPLALNLAL HSSEKRERWL GWGASAAILL AVIATQSRGG FLGLSVAAAV FFLHSRKKKQ
LILITSVLLG VAALFVTDGY LNRMSTITAE EEERDLSSGS RLVLWKAGMM VFRDNPIIGT
GFLSFPIAKQ EYKDAVPNVD EKLREYSFRG YKVSHNSYVQ ALSEGGLFLF VPYALLILGC
LWGNRALRRA KRDQAEHRLM LLLNGIEAGI VGYCVCIVFI NALTVVLLPV QIVVSRVIRD
TLQREEESAL PAVEAAL