Gene Athe_2167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2167 
Symbol 
ID7408360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2295739 
End bp2297277 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content32% 
IMG OID643716532 
Productflagellar hook-length control protein 
Protein accessionYP_002574015 
Protein GI222530133 
COG category[N] Cell motility 
COG ID[COG3144] Flagellar hook-length control protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCACAC AGAACATTGT TAACACTAAT ATGCTGTTTT TAAAAACATT CTCTGCGACA 
TCAAAGACAA AAGATAGGCA GGAAAACAGT ATATCATTTA AAGATGTTTT CAAAAAGGCA
TCTGACATTG CAAAAGATAC AGACAATAAA AGTAATAGTT TACAAAAAAA TGCAGAAGAA
AGCTACAGAG CAGCTATTGT AACTTCTAAA AGACAATTTC AAAATGAGAA CCAAAACTTG
CACGTAGATA CACCTTTTTC TAAAGAATTG CCAGACGGTA CAAACTCAGA CTTAGAAAAC
AGGGCTGAAA TTCAATCATT GCAAGCTCAG GTGATGGAAT TTTTACAGCT AATTTTTAAT
TTGATACAAA GCGGTGAAAG CTTGGACAAG TTTAATTTAG ACAAGCTTTT TCAGAATAGT
AGCATCCAAG AGACAAACTT TTTAAATTCG CAGCTGCAAT CACTAAAGAT GGATGTGGAT
TTAGATCAAT ATTTAAATTT AAACATTAAT CAAAGCAATA AAACAACAAT TACGAAAATT
TTACACAATA TATTGCAAAA AATGGTTAAA GATACTCAAA ATCAGCAAGT TCAAAATTTT
ATCTATGTGC AAGGAAGTGA AGTTAATCGA GAAAATATTT TTGAGTTGTT AAAAGAGGTC
TTGCTTGAAA AAGAAGGGGA AAAACAGATA TTTGATGTTA AATCGGATGA TAGTTCATGG
GTTAAAAGTT TTATAAATCT TTTTGCACAG GAAGGTCAAA GCTTCAAAGT TTTGTCTGAA
GCAAGTGGTG GAAAAGAGAT TTTAAAGAAT GTTTTAAATG AGCTTGAGAA TATTGCTAAA
AGATTAAATG TGCAAAAAAT AGTAGATAGT TTTAATATGG AGAGTCCAGA GAGAGTACAA
ATAGCCGAAA AAGGGAACAG CAATTTTAAT GGTATAGGTT CAAATGATGG TGATTTTAAC
AAAGTTTTTT CAACCTTGTT GAAAAAAGAT GAAGGCAGTA GCATAAATGA CCAAAAAGGA
GAAGTTAAAA CCTTAGATTT GAGACAGCAT GTTTTTGCTT TTCAGAACAA GGTAGAGAAT
ATAGAGAACA CAACACCTTC CCAAAATGAC AGAATAATAA AAGATCTCAG AATGTCTATA
ATTAATCAGC TTGCAGAAAA AATTTCTGTA GTCAGTAGAC AGAATTTGAC TACATTGCAG
GTGAGCATAA AACCTGAGTG GCTTGGAAGT GTTGTGATTG AACTGAGCAA AGATAGTAGC
GGAAAGATTT TTGGGAATCT CATTGTAACA ACGCCGCATG TTAAAGAAAT CATAGAAGGG
TCACTGAATA CTCTTCTTAC TATACTCAAA GACCAGGGAA TAAATATATC ACAACTTAAT
GTAAGCTTGG GAGGAAATTT TACTGGTCAG CAGAATCAAG AACAGCAGAG GTTTTCTCAA
AGAAAAAATT TGATTGTTCA AGGTAATGAG GAGAGTATCA GAAGTATAGA GAGTTTGATT
TATGAGATAA ATGAAAGTAT TCTTAACTTG AAAGCTTGA
 
Protein sequence
MGTQNIVNTN MLFLKTFSAT SKTKDRQENS ISFKDVFKKA SDIAKDTDNK SNSLQKNAEE 
SYRAAIVTSK RQFQNENQNL HVDTPFSKEL PDGTNSDLEN RAEIQSLQAQ VMEFLQLIFN
LIQSGESLDK FNLDKLFQNS SIQETNFLNS QLQSLKMDVD LDQYLNLNIN QSNKTTITKI
LHNILQKMVK DTQNQQVQNF IYVQGSEVNR ENIFELLKEV LLEKEGEKQI FDVKSDDSSW
VKSFINLFAQ EGQSFKVLSE ASGGKEILKN VLNELENIAK RLNVQKIVDS FNMESPERVQ
IAEKGNSNFN GIGSNDGDFN KVFSTLLKKD EGSSINDQKG EVKTLDLRQH VFAFQNKVEN
IENTTPSQND RIIKDLRMSI INQLAEKISV VSRQNLTTLQ VSIKPEWLGS VVIELSKDSS
GKIFGNLIVT TPHVKEIIEG SLNTLLTILK DQGINISQLN VSLGGNFTGQ QNQEQQRFSQ
RKNLIVQGNE ESIRSIESLI YEINESILNL KA