Gene Athe_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1664 
Symbol 
ID7409494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1758223 
End bp1759485 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content35% 
IMG OID643716033 
Productflagellin domain protein 
Protein accessionYP_002573531 
Protein GI222529649 
COG category[N] Cell motility 
COG ID[COG1344] Flagellin and related hook-associated proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATTA ATAATAACAT TCAAGCATTA AACACTTATA ACAGGTTAAC AATCAACAGT 
AATAATTTAT CAAAGTCTTT AGAGAAGCTT TCATCTGGTA TGAGAATCAA CAGAGCTGGC
GATGATGCAG CAGGTTTGGC TATTTCAGAG AAGATGAGAG CGCAGATTAG AGGTTTGGAT
CAAGCTACTC GAAATGCTCA AGATGCTATT TCGTTGATCC AGACAGCAGA AGGTGGATTA
AATGAGGTAC ATTCAATTTT ACAAAGAATG AGAGAATTAG CAGTTCAAGC AGCAAATGAT
ACAAATGTTG AACAAGACAG AAGTGCTATT AGTGATGAAA TTAACCAATT GATTCAAGAA
CTTGATAGAA TTTCTTCAAC TACCGAATTT AACACACAAA AACTTTTGGA TGGTACATTT
AGTGGAAAAT TTCAGATTGG CGCAAATGAA GGACAAACAT TGCAGCTCAA CATTAGTAAG
ATTAATTCAG CTTCTTTAGG ATTAGCATCA TCGATTGAGG TAGAAACTGT AAATAATGCA
AATGCGGGTA ACTTGATAAA AGATGGTGTA TATACAGTAG ATAGTTCAAA CAACTTGATA
GATGCAGCTG GAAAGATAGT AGGTACAGTG AGTGGGATGA CGATTACTTT AGCTGATGGA
ACCACAACAG TTAGTTTTAC AAGTGAAAAT ATTACAACAG GTGCTATTAT TAGAGTATCA
GACAATGGTA ATACTTTTAC TATGGAAAAG ACAGTAGTAA GTGGTCAGAC AAACGATAAG
TTAGCAGCTG GGACATATAC AATAAGTGGT TCAGATGTGT TGAAAGACAA TGTAAAAATA
GGTACGGTAG ACACTACAAC TAAAACAAAA ATTAATTTGT TAGATGGCAC GTCAATTGAT
TTAGCAATAG TATTTGGTAA GACGGCTAAT TTGGCTGATG GGGATACGTT TGTAATTAAA
GGTGTTAATG TATCAAACAA TTTCCTTGCA AGTGGAAGTG TAACGGCTAT AGATAAAGCG
TTAGAACTTG TATCAAAAGA AAGAGCAAAA CTGGGTGCTT ATCAGAATAG ACTTGAACAT
ACTATTACAA ACCTTAGCAC ATCTAGTGAA AACTTGACAG CTGCAGAGAG CCGAATCAGA
GACGTTGACA TGGCAAAAGA GATGATGAAT TATACAAAGA ACAATATTCT CATGCAGGCT
GCAACAGCAA TGCTTGCACA AGCTAATCAG CTTCCGCAAG CAGTATTACA ATTATTAAGA
TAA
 
Protein sequence
MRINNNIQAL NTYNRLTINS NNLSKSLEKL SSGMRINRAG DDAAGLAISE KMRAQIRGLD 
QATRNAQDAI SLIQTAEGGL NEVHSILQRM RELAVQAAND TNVEQDRSAI SDEINQLIQE
LDRISSTTEF NTQKLLDGTF SGKFQIGANE GQTLQLNISK INSASLGLAS SIEVETVNNA
NAGNLIKDGV YTVDSSNNLI DAAGKIVGTV SGMTITLADG TTTVSFTSEN ITTGAIIRVS
DNGNTFTMEK TVVSGQTNDK LAAGTYTISG SDVLKDNVKI GTVDTTTKTK INLLDGTSID
LAIVFGKTAN LADGDTFVIK GVNVSNNFLA SGSVTAIDKA LELVSKERAK LGAYQNRLEH
TITNLSTSSE NLTAAESRIR DVDMAKEMMN YTKNNILMQA ATAMLAQANQ LPQAVLQLLR