Gene Athe_1883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1883 
Symbol 
ID7408996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1990342 
End bp1991397 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content40% 
IMG OID643716255 
Producttwitching motility protein 
Protein accessionYP_002573744 
Protein GI222529862 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000175789 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATATTA ATTCAATTCT CAAAGAAGCA TTTCTCAAGG AGGCTTCAGA TATACATATT 
ACACCAGGTG TTCCTCCAAT TTATCGAATA CACGGCAGAT TAGTACGGAC AGATGATTCA
ATTCTGACCC CTGAGATGGT GGAGGAGTTC GTGCGACAGA TCACCAACGA AAACCAGTTT
AAAATTCTTG AGCAGAAAGG TGAGATAGAC TTTTCTTACG GCATAAAAGG TGTAAGCAGA
TTTAGAGTAA ATGTTTATAA ACAAAGAGGG TCATATTCTA TAGCTTTTAG AATAATTCCA
GTAAATATAC CACCATTTGA GACACTTGGT CTTCCACCAG TTTTGAAAGA ATTTACAAAA
TTGAATAAAG GGCTTGTTTT AGTTACAGGT CCAACCGGTT CGGGTAAGTC AACAACACTA
GCATCGCTGA TTGACATAAT CAACAAAGAA AGAGATGTGC ATATAATCAC ATTAGAGGAT
CCAATAGAGT ATTTGCATAG ACACAACAAG AGTATTATCA ATCAAAGAGA GATAGGTAGT
GACACGCTCA GCTTTGCAGA CGCGCTGAGG GCGGCTTTGA GAGAAGATCC TGACGTAATC
CTTGTTGGTG AGATGAGAGA TTTGGAGACA ATTGCAATAG CTTTAACAGC TGCTGAAACA
GGACACCTAG TGTTTTCCAC ACTTCACACA ATTGGAGCGG CAAAGACAAT AGACCGTATC
ATTGACGTTT TTCCACCGTA TCAACAGCAG CAGATAAGAA TTCAGCTATC AACTGTTTTG
CAGGGAGTAG TGTCTCAGCA GCTTTTGACC CGTCGTGATG GTAAAGGCAG GGTTGTTGCA
ACAGAGGTAA TGATAGTAAA TCCGGCCATA AGAAACCTCA TTAGGGAGGC TAAAACGTAT
CAGATTCAAT CAATTATTCA GACGCATCAG CGACAGGGCA TGATAACAAT GGAGCAGTCA
CTCATAGACT TGTACAAACG AGGGTTTATT ACCCGTGAAG ATGCGTTCAA CTATGCTACT
GACTTTGATT TTATGCAAAG ACTGCTCAGT GCATAA
 
Protein sequence
MDINSILKEA FLKEASDIHI TPGVPPIYRI HGRLVRTDDS ILTPEMVEEF VRQITNENQF 
KILEQKGEID FSYGIKGVSR FRVNVYKQRG SYSIAFRIIP VNIPPFETLG LPPVLKEFTK
LNKGLVLVTG PTGSGKSTTL ASLIDIINKE RDVHIITLED PIEYLHRHNK SIINQREIGS
DTLSFADALR AALREDPDVI LVGEMRDLET IAIALTAAET GHLVFSTLHT IGAAKTIDRI
IDVFPPYQQQ QIRIQLSTVL QGVVSQQLLT RRDGKGRVVA TEVMIVNPAI RNLIREAKTY
QIQSIIQTHQ RQGMITMEQS LIDLYKRGFI TREDAFNYAT DFDFMQRLLS A