Gene Athe_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1310 
Symbol 
ID7408891 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1399394 
End bp1400509 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content34% 
IMG OID643715675 
ProductThiamin pyrophosphokinase catalytic region 
Protein accessionYP_002573183 
Protein GI222529301 
COG category[S] Function unknown 
COG ID[COG4825] Uncharacterized membrane-anchored protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAG GGAAGGTTAG AGTTGATAGA AGAACTAAAA ACTTGGTAAG AAGACTAAGA 
CCAGGTGAAA TACCTGTTAT AATGCACGAG GATATTGATG AGGTAGCTGC ATATTCTCTT
TTGGAGAAAA AAGTGAGGGT TGTAATCAAT TGTGCTAAAT CTTTTACAGG GAAATTTCCA
GCAGTTGGTG CAAAGATTCT TCTTGCACAT GATGTGATCA TAATTGATAA TTTAGGGGAA
GATGTATTTA ATCGAATAAG AGAAGGGGAC GTTGTAGAAA TCGAAGATGA TAAGATATTT
TTAAATGGTA ATTATCTATG CATTGCAAAA TATCTTACTA AAGAAGAATT TGAATCATTT
TATCAAAAGA GTTTCAAAGA AATGGAAAAT CTTTTGGAAG ATTTTATAGA AAATACATTG
GAGTATGCAA AAAAAGAAAA AGGATTTATC TTAGGACAAT TTGAAATGCC TGATATTTCA
ACTAAAATTG CTGGCAGACA TGTACTTGTT GTGACAAGAG GAAGCAGTTT TAAAAAAGAT
ATAAAAGCAA TAAAAGGTTA TATTACAGAG GTAAAACCAG TTGTGATTGC AGTTGATGGC
GCTGCTGATG CATTGCTTGA GGAAAAGATA AGACCAAACA TTATAATTGG GGATATGGAT
AGTGTATCTG AAGAAAGTCT TTACAAATGT GACGAGATAA TTGTTCATTC ATATCCAAAT
GGATATGCAC CAGGGCTAAG AAAAATACAG GCTTTAGGAC TTAAGGCAAA AACAATAGCA
TGCCCTGGTA CGAGTGAAGA TGTTGCTTTG CTTTTGGCTT ATGAAAAGGG GGCAGAACTT
ATAGTTTCGG TTGGTTCTCA CAGCAGTATG CTTGATTTTT TAGAGAAAGG TCGAAAAGGA
ATGTCAAGCA CTTTTCTGGT CAGGCTAAAA ATAGGTTCAA AGCTTGTGGA TGCAAGAGGT
GTATCCAAGC TTTATACTGA AAAGGTAAGT TTCAAGTATA TTGGGGTTTT GTTGTTTTCT
GCACTTATTC CTATACTTGC AATCCTGATG GTAACTCCGC CTTTTCAATA CTTTTTCTAT
TTAATTCAAC TAAAACTGAG AGTAATCTTG AGGTAG
 
Protein sequence
MIKGKVRVDR RTKNLVRRLR PGEIPVIMHE DIDEVAAYSL LEKKVRVVIN CAKSFTGKFP 
AVGAKILLAH DVIIIDNLGE DVFNRIREGD VVEIEDDKIF LNGNYLCIAK YLTKEEFESF
YQKSFKEMEN LLEDFIENTL EYAKKEKGFI LGQFEMPDIS TKIAGRHVLV VTRGSSFKKD
IKAIKGYITE VKPVVIAVDG AADALLEEKI RPNIIIGDMD SVSEESLYKC DEIIVHSYPN
GYAPGLRKIQ ALGLKAKTIA CPGTSEDVAL LLAYEKGAEL IVSVGSHSSM LDFLEKGRKG
MSSTFLVRLK IGSKLVDARG VSKLYTEKVS FKYIGVLLFS ALIPILAILM VTPPFQYFFY
LIQLKLRVIL R