Gene GM21_3184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3184 
Symbol 
ID8138536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3696282 
End bp3697610 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content59% 
IMG OID644870789 
ProductO-antigen polymerase 
Protein accessionYP_003022969 
Protein GI253701780 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones111 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGGTA TGCTTGCCTT GCTTGCCTGT CTGACTTTAG TTGCCTGGCT TTTCATCCGG 
GACAACAGGG TTCGCCCGAT GCCGTCCCCA GAGTCGTGGC TCACGCTTGC CTGGTTCTTT
ATCGTGGGCA CCAGGCCGCT CTCGGCATGG TTTTCAATCC CGGAAGAGGA TTCCTCCGAC
GCCTTCCTGG AGGGAAGCCC GCTGGACCGG TACTCCCTGC TGGTGCTGAT CCTGTGCGGC
TGCCTCGTCC TGTTGCACAA GCGGCCGCAG TGGCGGAATG TGCTTCGTTC CAACCTCTGG
TTCACCGGCT TCATCGCCTA CTGCGCTATC AGCGTCATCT GGTCCGACTA CCCTTTTGTC
AGCTTCAAGA GATGGGTGCG TGAGTTTGGC AACCTGGTGA TGGTCGTACT CATCCTCACT
CAGGACGACC CTGCCAAGAC CTGCCGGGCG CTGCTGGCAA GGTTCGCTTA CCTGGTGATT
CCGCTTTCCG CAGTTCTCAT AAGTTATTTT CCCTCCCTTG GCACCTACTA CAGCAGCGAC
CTTGCCGGCA TCGCCTATTG CGGGGTCGCC ATCCACAAGA ACATGCTAGG CAGCATCATG
TTCATCTCGG CAGTTTACCT TGCCTGGGAA CTCATCTACG TGCCGGACGC CAGGGAGACC
AAGGCATGGG ACCTGACCCT GCTTGCCGCG CTCTGGTTGA TGGCGGTATG GCTCATGCTG
GTGGCAAGCA GCTCCACAGC TCTCATCTGT CTGGCACTGG GGTCAGCGAT GCTGCTTATG
TTGAAGCTTT CCTTTGCCAG AAAGCAGGTC CGGCACCTCG GTGTGTACAG TCTGCTCGGG
GCCAGCCTGC TTGTGACGCT ATTCTCACTT CAAGGAGCGG TGGAGATGAT CACGGGGGCG
GTGGGGCGGG ACCTGACCTT CACCGGGCGC ACCGAGCTCT GGGCTGACGT CCTCAGGGAG
CCCATAAACC CGCTAGTTGG CACCGGATAC CAGAGCTTCT GGCTTGGAGC CCGTGCCGAT
GATCTGTGGG AGCGCTACCT TTTCCATCCG CGGCAATCCC ACAACGGTTA TCTGGAAACC
TACCTGAACG GCGGGCTGCT TGGTCTGTTC CTGCTGCTCG CGGTGATCGC GTCCATCGGG
AAACGCCTGA AAGGGGGGCT TCTATCCGGT AACAATTTCG CTGTCCTGCT CTTTTCCTTC
TGGGTGGCGG GGCTTTTTTA CAATTTCACC GAAGCACGCT TCGTAGGTCC CAATCTGATT
TGGATCATGC TGAGTCTCGC CGCGTTGTAC CAGCCAGAGA AGGGAGAGTC GCTGCAAACG
GCGGGGTAG
 
Protein sequence
MLGMLALLAC LTLVAWLFIR DNRVRPMPSP ESWLTLAWFF IVGTRPLSAW FSIPEEDSSD 
AFLEGSPLDR YSLLVLILCG CLVLLHKRPQ WRNVLRSNLW FTGFIAYCAI SVIWSDYPFV
SFKRWVREFG NLVMVVLILT QDDPAKTCRA LLARFAYLVI PLSAVLISYF PSLGTYYSSD
LAGIAYCGVA IHKNMLGSIM FISAVYLAWE LIYVPDARET KAWDLTLLAA LWLMAVWLML
VASSSTALIC LALGSAMLLM LKLSFARKQV RHLGVYSLLG ASLLVTLFSL QGAVEMITGA
VGRDLTFTGR TELWADVLRE PINPLVGTGY QSFWLGARAD DLWERYLFHP RQSHNGYLET
YLNGGLLGLF LLLAVIASIG KRLKGGLLSG NNFAVLLFSF WVAGLFYNFT EARFVGPNLI
WIMLSLAALY QPEKGESLQT AG