Gene Athe_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1336 
Symbol 
ID7408917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1422403 
End bp1423644 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content33% 
IMG OID643715701 
Productstage IV sporulation protein B 
Protein accessionYP_002573209 
Protein GI222529327 
COG category 
COG ID 
TIGRFAM ID[TIGR02860] stage IV sporulation protein B 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAC TGGCGGGTGG ACTTTTTCTT TTCTATATTC TAATTACCAC CTTTTTAGTC 
ATCTATCTTT ACATTACTCC TGATTGTCTT ACTTGTTACA GCTCAGACAA AGCTATTACT
ATTAAAACTC CTATTTTTGT TAATGTTAAT CTAAGTCCTT CAAATGTTCA AACCAGAACA
CAAATAAAAT TTCTTTACAG AACAAATAAG ATTTATATTC CAAAGAAGCA TTCTTCAGTT
TTATGCGAGC TAAAAATTGG TACAATACCG CTCAAAAAGG TAAGAATCTC TATTCTTGAA
TCAAACAAGG TCTGGGTTTC TGGGAAATTT ATAGGAATCA AGCTTATGAC AGATGGAATA
CTTGTTATAG GATATTCTTA CGTAAGTAAT GGTAGTAATT CAACTTCACG AGTTCCTGCA
AAAGAAGCAG GTATCCAAAT AGGTGATAAG ATTGTATATG TAAATGGGCT GAAGGTAAAA
GACTGCAATC AGCTTTTTAA AATCATAAAC TCATCAGGTG GCAAGTCCTT AGTTTTTGTA
ATCAAAAGAG GACAAACCTA TAAACAGTTC AAAGTAAAAC CACTTCTAAG TAACGAAGGT
GTATACAAAA TAGGACTGTG GGTCAGGGAT GGTACAAGTG GCATTGGAAC AGTTACATTT
GTAGATACCA AAAGAAAGGT TTTTGGTGCT CTTGGCCATG GTATATCAGA CATAGACACA
GGTATTCTTC TGGATGTGAA AGAGGGACAA ATTTATTCAG CCGAAATAGT TGATATAAGA
AAAAACGATA AAAGTGAGAT TGGCGAAGTT GTTGGCAAAA TCAATGAAAA CTGTGTAGTT
GGCGATGTGA TTATTAATAC TCCATACGGG ATTTATGGTA AAATAATTCA AAATGGTTTT
TGGGATAGCC TTCAAAGTAT CGAGATTGCC CGACTTCAGG ATATTCACGT AGGTAGTGCA
TATATTTTAA GTGAAGTTTC AGGGAATATT GAAAAGTTTG AAATAAAAAT AGAAAGAATT
TTGCCTCTTT ATAGAAATTC GACAAAAGCA TTTGTTATAA GGATTACTGA TAAAAGACTT
CTTCAGCTCA CATCTGGAAT TGTTCAAGGA ATGAGTGGCT CTCCAATTAT TCAGGATAAT
AAACTTGTCG GAGCTATTAC TCATGTTTTT TTGCAAGAAC CAGAAAGAGG ATACGGTGTT
TTTATTGATA ATATGCTAAA TATCACAAAA TATATCAAAT AA
 
Protein sequence
MKKLAGGLFL FYILITTFLV IYLYITPDCL TCYSSDKAIT IKTPIFVNVN LSPSNVQTRT 
QIKFLYRTNK IYIPKKHSSV LCELKIGTIP LKKVRISILE SNKVWVSGKF IGIKLMTDGI
LVIGYSYVSN GSNSTSRVPA KEAGIQIGDK IVYVNGLKVK DCNQLFKIIN SSGGKSLVFV
IKRGQTYKQF KVKPLLSNEG VYKIGLWVRD GTSGIGTVTF VDTKRKVFGA LGHGISDIDT
GILLDVKEGQ IYSAEIVDIR KNDKSEIGEV VGKINENCVV GDVIINTPYG IYGKIIQNGF
WDSLQSIEIA RLQDIHVGSA YILSEVSGNI EKFEIKIERI LPLYRNSTKA FVIRITDKRL
LQLTSGIVQG MSGSPIIQDN KLVGAITHVF LQEPERGYGV FIDNMLNITK YIK