Gene Athe_1789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1789 
Symbol 
ID7408576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1862016 
End bp1863149 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content33% 
IMG OID643716166 
Productstage II sporulation protein P 
Protein accessionYP_002573655 
Protein GI222529773 
COG category 
COG ID 
TIGRFAM ID[TIGR02867] stage II sporulation protein P 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAGG TTGTTGATTT TAAAAAGGTA GTATTGGTAA CTACTATACT TTTTGTGGTT 
GGTGTTGGCT TTTTAGTTGA AAGATTAGTT TTTTCAAATC AAGTAGCCAC AGCAGTGCTT
TTCAGCTATT CAAAAGAGAT TATTTCATTT AATATACCAA TTTTTTCTGA CCATTTTGCA
AACAAGATTA TTAAGATTGA AAATATAGTA AGGTTTTCTT ATCCGATGTT TGCTGGAACA
AACTTTCAAG AAGTTGAAGG TGTACCTTTA TATGAAGACG ATGCTATTAT GATAGATTAT
AATCAGCAGA CCCAAGAAGA TAAGAAAAAT GTTCAATCAG AAAGCCAGAA TGAAAATATC
GAATTTCAGA AATACTTTAC AAATAGTACT CAAAAAGTTG GGACCTGCAA TAACATAGAG
ATTATGAATC AGACAGATTA TAAAATTGAT GCTAATATCC TTTTGAAGAC AAATTTCAAA
ATCTTCAATG GAAAAAAACC GTCCATTTTA ATTTACCATA CTCACACAAC AGAAAGCTAC
AATTCTTTTT CTCAAAACCT TGTATACACT CCTGGCACAA CAGACAGAAC ACTTGACTTT
AACTACAACG TTGTGAGAGT AGGGGAGGAG TTAAAAAGAA TCTTAGAAAA ACAATATGGT
TATAAAGTTT ATCACAGCAA AGATGTAAAT GATTATCCAG AATACAAGGG TTCTTATTCG
CGGTCATTGA AGGTAATAGA GAAATATAAA AGTGAACATC CTGATATAAA AGTCTTTATA
GATTTACACA GAGATGCTAT TGGAAATGGT TCAAAAAAAG TAAAGGTTTC AACAGTTGCG
TTTGGATATG AGGTTGCAAA GGTAATGCTT GTTGTAGGGA CAGACAAGCT TGGGCTTTAT
CATCCTTTTT GGCGACAGAA CCTTCTGTTT GCTGTGCATC TTCAAAAAAA TCTCAGCAAA
ATATGCCCTC AGATTACAAG ACCTATAAAC CTCTCTGCTG CACGATACAA TCAACATGTA
TCACCATATG CTATAATCAT TGAAATTGGT AGCAATGGGA ATACCTTAGA AGAAGCCTTA
AGGAGTTGCC AGATTGTTGC AAAGGCATTG GATGATACTA TCATGGGAAG GTGA
 
Protein sequence
MVKVVDFKKV VLVTTILFVV GVGFLVERLV FSNQVATAVL FSYSKEIISF NIPIFSDHFA 
NKIIKIENIV RFSYPMFAGT NFQEVEGVPL YEDDAIMIDY NQQTQEDKKN VQSESQNENI
EFQKYFTNST QKVGTCNNIE IMNQTDYKID ANILLKTNFK IFNGKKPSIL IYHTHTTESY
NSFSQNLVYT PGTTDRTLDF NYNVVRVGEE LKRILEKQYG YKVYHSKDVN DYPEYKGSYS
RSLKVIEKYK SEHPDIKVFI DLHRDAIGNG SKKVKVSTVA FGYEVAKVML VVGTDKLGLY
HPFWRQNLLF AVHLQKNLSK ICPQITRPIN LSAARYNQHV SPYAIIIEIG SNGNTLEEAL
RSCQIVAKAL DDTIMGR