Gene Athe_1822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1822 
Symbol 
ID7408936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1894647 
End bp1896032 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content34% 
IMG OID643716199 
ProductO-antigen polymerase 
Protein accessionYP_002573688 
Protein GI222529806 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.831736 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGAAA AGAAATCTTT ATATATCACC TCTTTTTTGA TATTTGTGCT GGGAATTGCA 
AGCAGTTTAA TTTCTGTAAA GTTAGGTATA CTTGTACTTG GTCTGTTTTT GCTTGTGCTG
ATAATGTTAG AAGACCCATC TAAGTTGATA TATGGTGTTG TACTTTATGC TTTTGTGGAT
TTTCTTTTCA GAAAACTTTC TATTTTAAGC AGTTTTGCTT CTGTATGGGA TGAAGCGCTA
TTTTTGATAA TTGTCTTTGC GTTCTTATTA AAGTCAATTA TAAAAAATCA ATCAAGTTTG
AGATTTTCCC CTTTAGATGT ATATATTCTG ATTTTTTTGC TTGTCTGTGT GTTTTTACTG
TTTAAAAATT CGCCTGATAT GAGAATAGCA TTAGAAGGAT TTAGAGTATA TGCAGAGTAC
GCTTTATGGT TTTTCGTAGG ACTGTGGCTT TTAAAAGACG CAAGGCAATT TGAAAGGATT
ATTACAATAT TTATCCTGAT GATGTTTATA ATATCCATCT ACGGCATATA CCAGTATATT
ATTGGTGTTG AAATACCTTC AAGTTGGATT GATAGCAGTA GAGAAACGTA TATAAGAACA
AGGGTATTTT CAATTATAGG AAGTCCAAAT GTCCTTGGAA GTCTTTTGGC AATGTCAATA
CCTTTTGTTC TTCCTTACGT CCTTTATGAG AAAAATATTA AAAAGAGAAT TTATTATTCG
GTTGTATTAA TTTCAATGAT TGCCTGCCTT GGGTTTACAT TTTCAAGAGG AGCATGGCTT
GCATTTTTGT TTTCAATGCT TCTTTTTGGT TTTTTCATAG ACAAAAAAGT TTTGGGTATT
CTCTTTGCCA TCTTTACATC AGTTCCTATT TTGGCACCTT CAATTGTAAT GAGAGTGCTT
TATATGCTAA GCTCTGAGTA TGCCAAAAGC AGCGCAAGGG CAGGAAGAAT TGCCCGCTGG
ACAAAAGCAT ACGATATTTT GACACAGCAT CCTCTCTTTG GGGTTGGATT TGGAAGATTT
GGCGGTGCGG TTGCAAAGAG AAATATAGCC AATGCGTTTT ACGTAGATAA TTTTTATCTT
AAAAGTGCTG TCGAAATGGG AATCATTGGA GTAGGTATTA TGATTTTGGT GTTTATAGTA
GGGCTTTTGC TTGCTGCAAG AACTGTAAAA CATCTGCGCT CAAAAGAACT TAAGAATATA
GCAAGCGGAG TGCTCATTGG TCTTGCAACA GTCTTGATGC ATAACGTGGT TGAAAACATT
TTTGAAGTCC CAATGATGAC AACATATTTC TGGCTGTTTT TGGGATTTTT GTTTGCTCTC
AAATCAGCAG AAGATAAAAG CCAATTTTCT GAACAAAGCA ATATTTGCAA CAATGGGGGA
AGCTAA
 
Protein sequence
MVEKKSLYIT SFLIFVLGIA SSLISVKLGI LVLGLFLLVL IMLEDPSKLI YGVVLYAFVD 
FLFRKLSILS SFASVWDEAL FLIIVFAFLL KSIIKNQSSL RFSPLDVYIL IFLLVCVFLL
FKNSPDMRIA LEGFRVYAEY ALWFFVGLWL LKDARQFERI ITIFILMMFI ISIYGIYQYI
IGVEIPSSWI DSSRETYIRT RVFSIIGSPN VLGSLLAMSI PFVLPYVLYE KNIKKRIYYS
VVLISMIACL GFTFSRGAWL AFLFSMLLFG FFIDKKVLGI LFAIFTSVPI LAPSIVMRVL
YMLSSEYAKS SARAGRIARW TKAYDILTQH PLFGVGFGRF GGAVAKRNIA NAFYVDNFYL
KSAVEMGIIG VGIMILVFIV GLLLAARTVK HLRSKELKNI ASGVLIGLAT VLMHNVVENI
FEVPMMTTYF WLFLGFLFAL KSAEDKSQFS EQSNICNNGG S