Gene Moth_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_1553 
Symbol 
ID3832186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp1595727 
End bp1596746 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content61% 
IMG OID637829485 
Producttwitching motility protein 
Protein accessionYP_430405 
Protein GI83590396 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0974985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATA TCGAAACAAT CCTGGTAGCG GCTGCCGCCG CCGGTGCCTC CGACGTCCAT 
ATCTCTGTTG GGCTACCGCC GGTTTTCCGG GTGCACGGCG AACTCCAGGT CCAGCGCCAG
TGGGACCCCC TGGATTTGGA GATGACAGCC CGCCTGGTAC GGCCTATAGT GGGCGATAAG
TGGGAGGTAT TTCAGGAGCA AGGTGAAATC GATCTGGCTT ACTCCCTGCC CGGTGTAAGC
CGTTTCCGGG TCAATGTCTT TCACCAGCGG GGGAGCGTCG GGGCGGCCAT CCGCCTTATC
CCCAGGGAGA TACCCAGCCT GGAGTCCCTG GGTTTACCGC CGGTGGTGGC CGAACTGGCG
GGCAGACAGC ACGGCCTGAT CCTGGTGACG GGGCCGACGG GCAGCGGTAA ATCCACCACC
CTGGCGGCCA TGGTGGATAA AATCAACCGG GAGCGGAGCT GTCACATCAT CACCCTGGAA
GACCCCATTG AGTACTTGCA CCAGCACCGC CGCAGCATAG TCAACCAGCG GGAAGTGGGT
TCTGATACCA GGTCCTTTGC CAGCGCCCTG CGGGCCGCCC TGCGTCAGGA CCCCGACGTT
ATCCTGGTCG GGGAGATGCG CGACCTGGAA ACCATCGCCA CGGCCATTAC GGCTGCCGAA
ACAGGTCACC TGGTCCTGGC GACTTTGCAC ACCAGCAGTG CCGTCCAGAG TGTGGATCGG
ATCATTGACG TTTTCCCGCC CCACCAGCAG GGGCAGGTAC GGATCCAGCT TGCCGACACC
CTGGAAGGGG TGATCACCCA GCAGCTTTTA CCGCGGGCCG ACAGGAAGGG CCGGGTGGCG
GCTGTAGAAG TGTTGATAGC CACCCCGGCA GTGAAGAATC TCATCCGCGA GGGTAAAACC
CATCAAATCG TTTCCACTAT GCAGACCGGA GCCCGCTACG GGATGCAGAC CATGGAGATG
GCGCTGCGGC AACTGATCAC CCGGGGAGTA ATTTTTGAGG AAGGTTTAAA TATTGGTTAA
 
Protein sequence
MLDIETILVA AAAAGASDVH ISVGLPPVFR VHGELQVQRQ WDPLDLEMTA RLVRPIVGDK 
WEVFQEQGEI DLAYSLPGVS RFRVNVFHQR GSVGAAIRLI PREIPSLESL GLPPVVAELA
GRQHGLILVT GPTGSGKSTT LAAMVDKINR ERSCHIITLE DPIEYLHQHR RSIVNQREVG
SDTRSFASAL RAALRQDPDV ILVGEMRDLE TIATAITAAE TGHLVLATLH TSSAVQSVDR
IIDVFPPHQQ GQVRIQLADT LEGVITQQLL PRADRKGRVA AVEVLIATPA VKNLIREGKT
HQIVSTMQTG ARYGMQTMEM ALRQLITRGV IFEEGLNIG