Gene Athe_2152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2152 
Symbol 
ID7408345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2283344 
End bp2284462 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content36% 
IMG OID643716517 
ProductGTP-binding signal recognition particle SRP54 G- domain protein 
Protein accessionYP_002574000 
Protein GI222530118 
COG category[N] Cell motility 
COG ID[COG1419] Flagellar GTP-binding protein 
TIGRFAM ID[TIGR03499] flagellar biosynthetic protein FlhF 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATCA AGAGGTATTT GGCACATGAT ATGCAGGAAG CATTAATAAG AATAAAAGCA 
GATTTGGGTA AGGATGCAGT TATACTTTCA ACCAAAAAGG TAAGACAGAA AGGTCTATTT
GGTTTTTTCA AAAGACCCTT AATAGAGGTT ACAGCTGCAT GCGAGGATGA AAAGATAGTC
AAAAAAGAGG AGGAATCTAT AAAACAGGAA AGTTTGGCAT TGGGCTTCCA ACTGACCCAG
ATAAAAGAAC TTGAAAGAAA GATTGATTCT CTTGAAAAGG TTTTAAAAGA GGTTATAAAG
AAAGAGCAGG AAGAAGATAT TAGTCAGACA AAAGAACTTA GTAAGAAAAA TTTTATTGAT
GTTATGAGAG AAAATTTAAT AAAAAATGGC GTGGAAAGTG AAATTATAAA TATGCTGTTT
TCAAACCTAA GCGGAGAAGC TTCAATAAAC AACGTAGTAA ACAATATATA CAAGGGAATA
AAGAATATGC TCGGAGTGGC AGCACCGCTT TCATTCAACT CAAAAATTCC AAAGATTGTG
TTTTTTGTAG GGCCAACAGG TGTTGGCAAG ACAACCACAA TTGCCAAGAT TGCTGCTAAA
CTCATGTTTG AGGATGGGAA AAAGGTAGGA TTTATTACAG CAGATACATA CAGAATTGCT
GCGGTTGAAC AGCTTAAGAC ATATGCTGAG ATTATGAATA TCAAGACCAA GGTGTGGTAT
GAGGTGGACG AGTATGATAG AATAATTGAA AACTTTTCTG ACTCAGATGT AGTTCTTGTT
GACACTGCGG GAAGAAGTCA CAAGAATCAG GAACATATGG ACGAACTAAA AGCGTTTGTC
GCCAAAGCAA ATCCAGATGA AGTGTTTTTG CTTCTTAGTG CAACAACCCA GCCGTCGGTG
TTCAAAGAGG TGGTGAATAC CTATTCATTT TTAAATGATT ATAAAGTTAT CGTAACCAAA
GTAGATGAAG TATCAACTTA TGGGAACATA TTAAATATCC GCTACTTTAC ACAAAAGCCA
ATTGCGTATA TAACGACTGG TCAAAATGTA CCTGATGATA TTGAACAGTT TAACCCTGAA
CAATTTGCAA AACTTATCAT AGGGAGTAAG GTTTTATGA
 
Protein sequence
MRIKRYLAHD MQEALIRIKA DLGKDAVILS TKKVRQKGLF GFFKRPLIEV TAACEDEKIV 
KKEEESIKQE SLALGFQLTQ IKELERKIDS LEKVLKEVIK KEQEEDISQT KELSKKNFID
VMRENLIKNG VESEIINMLF SNLSGEASIN NVVNNIYKGI KNMLGVAAPL SFNSKIPKIV
FFVGPTGVGK TTTIAKIAAK LMFEDGKKVG FITADTYRIA AVEQLKTYAE IMNIKTKVWY
EVDEYDRIIE NFSDSDVVLV DTAGRSHKNQ EHMDELKAFV AKANPDEVFL LLSATTQPSV
FKEVVNTYSF LNDYKVIVTK VDEVSTYGNI LNIRYFTQKP IAYITTGQNV PDDIEQFNPE
QFAKLIIGSK VL