Gene Athe_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0462 
Symbol 
ID7407540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp528197 
End bp529543 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content34% 
IMG OID643714850 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_002572367 
Protein GI222528485 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00445069 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAT ATATTAAAAG CTATCACAAG CTGAGCAAAT TTTTTGTTGT TCTTATGGAC 
ATTTTACTTG TACATGCAGG TTATATAATT GCCTATATTA TAAAATTCAA CTTTACATTT
CCAGAGAAAA ATTTCATGCC TTATTATACA TTAATTCCTC AAATAACTCT TTTTGCTCTT
GTTCTTTTAA ATATTTATGG ACTTTATACT ATTACCATGA AAACCATTAG CGAAATAGCA
TTTTCGCTGG GTCTGGCTCT AATGCTTCTA CAATTTTTAA CAGTTGTATC AACATTTTTC
TACAGACACT TTGCTTTCCC ACGCAGTATA TTTATAATTG CCTTTTTTGT ACAGTTTTTA
TTGCTTTTGG GATGGAGAGG ACTTGTCCTA TATGTTTTCA AAAGAGTTCA AGGTGTCAAG
CATGTACTTG TGATTGGAGA GATGTCAAAG GCGCAGGAGT TTGCTCAAAA GCTTCAAAAT
ATTTCGAAAG GTTGGATAGA TGTAAAGTAT GTTCTTGAGC CAAAAGCTAT AGAAGAATTG
ATACTATATA TAAAACTTGT TGATACAATT TATATTTATT CAAAAATGGA TGAAAACTTA
AAAAGTGAGA TTGTAAGAAA GGCGATAGAA TTCAAAAAAC ATATTTTCAT AGCACCAGAT
TTTAGAGATA TATTGGTATC ACGTGCGAGG GTGATTCAGT TTGATGATGT TGCAACACTT
TCAATTGAAC AGCCAGAACT TACTTCCGAG CAAAAGCTTA TAAAAAGATT TTGTGACATT
CTTCTTGCAT CAGTTGCGCT GGTTATTTCT TTTCCTATAA TGATCTTGAT TGCAATTGCC
ATAAAAATTG ACTCAGAGGG GCCAGTAATT TACAAGCAAA AAAGGGTCAC AGAAGGAGAG
AGAGAGTTTT ATGTTTTAAA GTTCAGAACA ATGGTAAAAG ATGCAGAAAA GATGACAGGT
CCTGTTCTGG CAACCGAAAA CGACCCCAGA ATAACAAGGG TTGGAAGGTT TTTGCGCGCA
ACAAGGCTTG ATGAGCTTCC GCAGCTGATA AATATTTTAA AAGGTGAAAT GAGTTTTATA
GGACCAAGAC CAGAGCGTCC TTATTTTGTT GAGCAGTTTA AAAAACTCTA TCCTGAGTAT
TCGCTTCGTC ATAATGTAAA GGCAGGGCTC ACAGGACTTG CCCAGGTTTA TGGCAAATAT
GCAACAAGCC CTGAAGACAA GCTCAGGCTT GATTTGATAT ATATAAAGAA TTACTCTGTA
TTTTTAGACA TCAAAATTTT ACTGTTGACC TTAAAAACAA TTTTTACCAA AGAGGCAGCT
GAGGGGGTAA AAAACCAAAA AGGATAG
 
Protein sequence
MKSYIKSYHK LSKFFVVLMD ILLVHAGYII AYIIKFNFTF PEKNFMPYYT LIPQITLFAL 
VLLNIYGLYT ITMKTISEIA FSLGLALMLL QFLTVVSTFF YRHFAFPRSI FIIAFFVQFL
LLLGWRGLVL YVFKRVQGVK HVLVIGEMSK AQEFAQKLQN ISKGWIDVKY VLEPKAIEEL
ILYIKLVDTI YIYSKMDENL KSEIVRKAIE FKKHIFIAPD FRDILVSRAR VIQFDDVATL
SIEQPELTSE QKLIKRFCDI LLASVALVIS FPIMILIAIA IKIDSEGPVI YKQKRVTEGE
REFYVLKFRT MVKDAEKMTG PVLATENDPR ITRVGRFLRA TRLDELPQLI NILKGEMSFI
GPRPERPYFV EQFKKLYPEY SLRHNVKAGL TGLAQVYGKY ATSPEDKLRL DLIYIKNYSV
FLDIKILLLT LKTIFTKEAA EGVKNQKG