Gene Moth_1253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1253 
Symbol 
ID3833048 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1294752 
End bp1297250 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content60% 
IMG OID637829189 
ProductO-antigen polymerase 
Protein accessionYP_430110 
Protein GI83590101 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0517425 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACATA AAAAGAAAAT TAAAGACAAC GGCAGGGGAA AGGTTATTTC CCTGGAGAGT 
GCCGTCAATA AAGGGCGAAA AACCAAGACG GATAATAAGG CGCACGGGGA AAGTGACGCT
GGTAGCCAGG TGTCCCCCTG GCGCCGCCGG ACGGAAGATC AGGTCGCTGA AGAGACCTTC
TGGCCGGATA AGGGTAATCG AACTTTATTT GCCCTGGCCT TTAGCGGGCT AATTTTGCTT
CTTTTCTACC CGCCCTTCTT TCGCGGCCTG TTCTTCCCGG TGGAACAGCG CTGGACGCTG
ATACTGGCTG CCGTCATCTT TTTCTTCGCT TATCTGTGGA AGTTTTCCCG CCGGGAGATC
GCCTTTCTCA ACCGGCCCCT TGATTTCGCG GCGGCTGCCC TGGTAGTGGT CTATATCCTG
GCGGCCATAA AACCGGCTAG CCGGGGCCTG GCCATGGCCG AGATCGCGAA GGTGCTGCTC
TATTTTTTGA CCTTCTGGCT TGTTTCGCGC CTGGGAGGGC AGCGCCGAAC CCTTTACCTG
CTGCATGCCC TTTACCTGGC CGGTGTTGGT GCGGCTCTGG CCGCCCTCCT CGCGGCTACC
GATCTGGTTT ACATTAAAGA CGGGTTTGTT GGCGGTCGTT TTTTCTCCAC CCTGCAATAT
CCCAACGCCC TGGCCAGTTA TGTCGGCGCC GCCAGCATCA TCGGTTTTTA TCTCTGGGCA
TGGTCAGGAA ACCGGTGGCG CTATGGTTAC GCTGCCGCCA ACTACCTCTT GCTCATGGTC
TTCCTGGGGA CTGGCTCCCG TGGGGCCTAC CTGATTTTCC CGGCGGTGGT TTTCCTGTAC
TGGTTGCTGG CGCCGGGCGG CTACCGGCTG AATACCCTGG CCCACCTGGT GGCCTGCGGC
GCCGCCGCCC TGTTGGGCGT TGCCCGTTTT ATCCCCCTGG CCTTAGCCAA GGCCCACGGG
CAGGCCTGGG GCTGGTTTAC CCTGGGGCTG ATGGTGGCCC TAGTTGGCCA GTTGCTTATC
CAGGGGGCCG GAAGGGTTTT AACGACCCCC CGGGCCAGGA TGGCAGCCGG TTTGGTTATA
CTGGTCATCC TGGTAGGAGG CGCTGCCGTC TTTGTCCTGC ACCAGCCGGG AATTAGCGCG
GCATCTACCG GGGGACAGGT CCCCGGCGTC CTGGGCAAAA TCCTGCCGCC CCAGGTTGTA
TCTCGCCTCA AGGATATTAA CCTTAAGACC AGGAGCAGCC GGGAGAGGAT TATCTGGACC
CAGGATGCCC TGCAGATGGT GCGGCAGCGC CCCATCCTGG GCTTTGGCGG CGGCGGCTGG
GAAGCAGCCT ATCGCCAGTA CCAGCGGTAC TACTACAACT CCACCCAGGT GCATAACGAT
TATGCCCAGG TGGCCGTAGA AGCAGGTCTG GTCGGTCTGG TCGTCCTGGC GGCGGTGTGG
TTGTTGTTCC TCCTGGTCAC GGCCGGCAAC TACCGACATA GCCAGGGTCA AGGTCGCCTC
CAGGCCCTGG CCGTCGGCGC AGCTGCCGTC AACCTGGGAC TTCACGCGGC CATTGACTTT
GACCTGGCCC TGGGGGCGGT GTCGATAATG CTCTGGGCCT GTTTCGGCCT GGCCCGTAGC
CTTGAAGGCC AGCGTTTGGA GCCGGAGCCG GCCCTCCCAC CGTTAATTTT TAAGAACCGG
CAATTGCCCT GTATAACAGC CGTCTCCCTG GCAACCCTGG TGCTGATTCT ATTTGCCGGC
TCGTACCTGG CCGGGGTTTC CAGTTACCGC CAGGCTACTG CCGCCCTGCA GCAAAACAAC
CTCCCAGCTG CGGCAGCTTA CCTGGAAGAG GCCAGCCGTT ACGATCCCTT TACAGCTTCT
TATAACAGCG ATCTGGCAGG TATTTATCTA CAAGAAGGAA AAACTAAGGA GGCCCTGAAC
CAGGCCCTGG CTGCCAGCGC CAAAGAACCT TATAATCTGG CAATCTTAAA CCGCCTGGCT
GAAGTCTACT GGCAGGAGGG GTCGGCTCAG GAGGCCGTGG CCACCATGGA AAGGGCCCGG
CAACTGGCCC CCTGGGTCGG CGCTGTCTGG GAAAACCTGG GTCAGGTTTA CGATGCCGCC
GGCATCAGTT ACCTTCAGGC CGGACAGAAG GATAGGGCCC GGCAAATGTT CCAGGAAGCG
GCGGCCCTGC CGGAGTCTAT CCAAGCCAAG GTGGATACCC TGGGAGATTT TAAAGACCTC
CATCAACCCG GCGGAGTAGC CCTCAGTCCC GCCATTCAGC TCCGCGCAGG GATCGCCCAG
TACTTCCTGG GTCAGGAAAA AGAAGCCGCT ATTAATCTTG AGGCGGCAGC AAGGGACGCC
AACCTGCAGG CCGAGGCCCG GCTATGGCAG GCCGTCATGG CCTTTCACCA TGGTGACGGC
CTCCTGTCTA GCCGGTTGCT GGCCGAAGTC CAGAAGACCA ACACCAGCCT GGCTAAGGAA
TACGACCAGT TAAAAACTCT GCCTGTTCTC TCTAAATAG
 
Protein sequence
MAHKKKIKDN GRGKVISLES AVNKGRKTKT DNKAHGESDA GSQVSPWRRR TEDQVAEETF 
WPDKGNRTLF ALAFSGLILL LFYPPFFRGL FFPVEQRWTL ILAAVIFFFA YLWKFSRREI
AFLNRPLDFA AAALVVVYIL AAIKPASRGL AMAEIAKVLL YFLTFWLVSR LGGQRRTLYL
LHALYLAGVG AALAALLAAT DLVYIKDGFV GGRFFSTLQY PNALASYVGA ASIIGFYLWA
WSGNRWRYGY AAANYLLLMV FLGTGSRGAY LIFPAVVFLY WLLAPGGYRL NTLAHLVACG
AAALLGVARF IPLALAKAHG QAWGWFTLGL MVALVGQLLI QGAGRVLTTP RARMAAGLVI
LVILVGGAAV FVLHQPGISA ASTGGQVPGV LGKILPPQVV SRLKDINLKT RSSRERIIWT
QDALQMVRQR PILGFGGGGW EAAYRQYQRY YYNSTQVHND YAQVAVEAGL VGLVVLAAVW
LLFLLVTAGN YRHSQGQGRL QALAVGAAAV NLGLHAAIDF DLALGAVSIM LWACFGLARS
LEGQRLEPEP ALPPLIFKNR QLPCITAVSL ATLVLILFAG SYLAGVSSYR QATAALQQNN
LPAAAAYLEE ASRYDPFTAS YNSDLAGIYL QEGKTKEALN QALAASAKEP YNLAILNRLA
EVYWQEGSAQ EAVATMERAR QLAPWVGAVW ENLGQVYDAA GISYLQAGQK DRARQMFQEA
AALPESIQAK VDTLGDFKDL HQPGGVALSP AIQLRAGIAQ YFLGQEKEAA INLEAAARDA
NLQAEARLWQ AVMAFHHGDG LLSSRLLAEV QKTNTSLAKE YDQLKTLPVL SK