Gene Athe_0250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0250 
Symbol 
ID7407567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp300839 
End bp302107 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content36% 
IMG OID643714650 
Producthypothetical protein 
Protein accessionYP_002572173 
Protein GI222528291 
COG category 
COG ID 
TIGRFAM ID[TIGR02679] conserved hypothetical protein TIGR02679 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAG ATAAGATTTT CGATGAGTGT GTAGAGTACT TTTCAAAGCC AGGATTTAAG 
CGTGCGCTGA AGCTTATTCA TAGTAAGTAT AGGTCTTTGG GGCGATTTTC TGGCAAGATT
ATTTTAGAAA ATCCATCTGA AGAGGAAAAA GAGACTTTAT CGCGGTATCT TAGAAGGGTT
TTGAGAGGCG AGAAGGTTGT CATTGATGTA AAAGACTTTA CCGTGACAAA GTTTCAGGAC
ACTAAGTTCT CAGGGCTTGA TTTTAAAAGC ATTCTGTCAG CAGTTTTGAG AAAAGAGGTT
ATCACCAAAA AAGAAGAAAA GGAGTTGAAA AGTGGAAGGA TATTAAAGTT TTTTAAAAGT
TTGTCAGCAC ATTTTGAGGG TGATGAAAAT GCTGCAGAGG TTTTAAATGC TTTTAAAGAG
AATTTCAAAT CATTTGAGAG CTTTTATAAA AAGTATTCAC AAGAAGAGTT TTTAGAGATA
ATGAAAAAGG TCATAGAAGC AATTTTAAAA AAACCACAAA GCCCTGAGAC TTTGGCTATT
TTTGCAACAA GGGTTACAGG CAACCCTCAC TTTTTTGATG ATGAGCAAGA TGCAGGAAAG
ATATTTTTAA AGCTTTTGAG CATTATAAAC GGTAGAGAGT TTCCCCAAAA TGCAGAGGAA
AAATCAGAAC TACTTTTTGG TAACAACATC TTAATTGATG AACTTTCAAA CTGGTGCCTT
TTGTATAACA TTGGCGGGTA TATTGAAGAT GGAAAAGAAG ATGAAGGGCT CAAGTACTTT
AGCAATCAAA AAAAGCCTAT TATCTTACCA CTTTATACTA TAAAGGATTA TAAAGGATTT
TTTGCATACT CAAATAAGCT TGTGGTTGTT GAAAACCCTG CTGTATTTTC TGCGATTGTG
CAAAGAGTCC CAGCTATTTC TGCTGTGTGC ACAAATGGGC ACCTGAGGCT CTCAAGCAAG
ATAATCATTG GAAGCATTGC AAAGACAAAT ATATCTTTGC TGTACTCAGG CGACTTTGAC
CCAGAAGGGC TTTTGATTGC AGACAGAGTA ATTCAAAACT TTGGTGCAAT GCCACTTTGT
ATGGATGAAG TCCACTATTT TTTGGCACTG TCTGAAAATA AGATAGATGA AAGGCGCTTA
GAGATGTTAA AGAATGTAAA AAGTGCTCAG CTACAAAGCG TCTGCAAGAA AATGAAGGAG
CTTCAGCTTG CTGGGTATCA GGAGAGGATT GTGGATAGGA TTGTTGAGAA GCTAAAAGTT
AATATTTAA
 
Protein sequence
MTKDKIFDEC VEYFSKPGFK RALKLIHSKY RSLGRFSGKI ILENPSEEEK ETLSRYLRRV 
LRGEKVVIDV KDFTVTKFQD TKFSGLDFKS ILSAVLRKEV ITKKEEKELK SGRILKFFKS
LSAHFEGDEN AAEVLNAFKE NFKSFESFYK KYSQEEFLEI MKKVIEAILK KPQSPETLAI
FATRVTGNPH FFDDEQDAGK IFLKLLSIIN GREFPQNAEE KSELLFGNNI LIDELSNWCL
LYNIGGYIED GKEDEGLKYF SNQKKPIILP LYTIKDYKGF FAYSNKLVVV ENPAVFSAIV
QRVPAISAVC TNGHLRLSSK IIIGSIAKTN ISLLYSGDFD PEGLLIADRV IQNFGAMPLC
MDEVHYFLAL SENKIDERRL EMLKNVKSAQ LQSVCKKMKE LQLAGYQERI VDRIVEKLKV
NI