Gene Athe_2601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2601 
Symbol 
ID7409560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2739216 
End bp2740691 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content39% 
IMG OID643716970 
Productstage IV sporulation protein A 
Protein accessionYP_002574439 
Protein GI222530557 
COG category 
COG ID 
TIGRFAM ID[TIGR02836] stage IV sporulation protein A 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000143605 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCTG ATATCTACAG AGAAATAGCC AAAAGGACAA ACGGTGACAT TTACATTGGC 
GTGGTTGGAC CTGTCAGAAC AGGAAAGTCC ACATTTATAA AAAAATTCAT GGACCTTTTT
GTAATTCCTA ACATTGAAGA TGAGTATAAA AAAGAGAGAA CAAAGGATGA ACTACCTCAA
AGCGCACAGG GAAAAACCAT AATGACAACA GAGCCCAAGT TTGTTCCCAA TGAAGCTGTT
GAAGTTTTGC TATCAAGCGG TGCAAGACTG AAGGTACGGC TTGTTGACTG CGTTGGGTAC
CTTGTTGAAG GTGCAATGGG ACATTTGGAA GAAAACCACC CGCGCATGGT CACAACACCG
TGGTTCGAAA AGCCAATCCC GTTTGAAGAG GCAGCAGAGA TTGGCACTAA AAAGGTCATC
CAGGACCACT CAACGATTGG GGTAGTCATT ACAACAGATG GCACAATTAC TGATATACCA
CGAGAAAACT ACATAAAGGC AGAAGAGAGA GTGATTGAAG AATTAAAACA AATCAACAAA
CCATTTGTGA TTGTTCTCAA CACCGCAAAA CCCTACTCAC CCGACACACA AGAGCTCAAA
AAAGATCTTG AAGAGAAGTA CAAAATGCCA GTTTTGATTG TCAACTGTCT TCAGATGCAG
ATTGAGGATG TAAAGAGGAT TTTAGAGACA GTACTGTTTG AGTTTCCTAT TGTTGAAGTA
AAGATTAACC TGCCCAGATG GTTTGACGAG ATGGAAGATG AGTCGTGGCT CAAAAAAGAG
ATTTATGAAA AGATAAAAGA ATATGCAGAA AAACTTAACA AAATCAGAGA TATAACAGAC
CAGCTTGAAG TTTTAAAACA ACTTCCCCAG ATTGACAGGT GTGAGGTTGT AGGAATAAAT
TTGGGAGATG GGAAGAGCGA GCTTTCAATT TATTTCAAAG AAGGACTTTT GTTCAGGATT
ATTGAGGAGT TCACTGGATT TGAAATAAAA GGCGATCATC ACTTATTAAA ACTTCTTTCT
GAACTTGCAC AGGTGAAGAA AGACTATGAT AAACTCAAAG ATGCGCTTGA AGAAGCAGGT
GAAAAAGGTT ATGGTATTGT ACCACCATCC TTAAACGAAC TTAAGCTTGA AACTCCTGAG
ATAGTCAAAA GAGGAAATAG TTTTGGTGTA AGACTAAAGG CATCAGCTCC CTCTCTTCAC
ATTATAAGAG TTGAGGTTGA AACAGAGGTA TCACCAATTG TTGGAACAGA AAAACAAAGT
GAAGAGCTTG TAAACTTTTT GATGAGAGAG TTTGAAGATG ACCCAAAAAA GATTTGGGAG
TCCAATATAT TTGGAAAATC TCTGCACGAG CTTGTTAAGG AAGGCTTGCA AAACAAGCTA
TACAGAATGC CAGAAGATGC TCAAGCAAAG TTGAAAGAAA CATTGCAAAA AATTATAAAT
GAGGGAAGTG GCGGGCTTAT TTGTATTATA CTTTAA
 
Protein sequence
MDADIYREIA KRTNGDIYIG VVGPVRTGKS TFIKKFMDLF VIPNIEDEYK KERTKDELPQ 
SAQGKTIMTT EPKFVPNEAV EVLLSSGARL KVRLVDCVGY LVEGAMGHLE ENHPRMVTTP
WFEKPIPFEE AAEIGTKKVI QDHSTIGVVI TTDGTITDIP RENYIKAEER VIEELKQINK
PFVIVLNTAK PYSPDTQELK KDLEEKYKMP VLIVNCLQMQ IEDVKRILET VLFEFPIVEV
KINLPRWFDE MEDESWLKKE IYEKIKEYAE KLNKIRDITD QLEVLKQLPQ IDRCEVVGIN
LGDGKSELSI YFKEGLLFRI IEEFTGFEIK GDHHLLKLLS ELAQVKKDYD KLKDALEEAG
EKGYGIVPPS LNELKLETPE IVKRGNSFGV RLKASAPSLH IIRVEVETEV SPIVGTEKQS
EELVNFLMRE FEDDPKKIWE SNIFGKSLHE LVKEGLQNKL YRMPEDAQAK LKETLQKIIN
EGSGGLICII L