Gene Acid345_0689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0689 
Symbol 
ID4071334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp847883 
End bp849058 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content58% 
IMG OID637982695 
Producttwitching motility protein 
Protein accessionYP_589768 
Protein GI94967720 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCCT CTGAACCAGC GCTTTCGTCC GCACCTTCGC CATCGCCTAC GCCGGTGTTT 
ACCACCGATG AGATGCTCAG AACCATGCTG AAAGTCTCTG AAAAGGTCAG CGACCTGATC
TTCTCGCCCG GGCGTGCTCC GCAGGTGGAA CTGAACAGCG CTCTCGTCGC GGTGCCAGGA
TTGCCGACAT TAATGCCCGT GGACACACGG CGAATCGCCG GCGATTTGAT GGGTAATAAC
GAACAAGCGA CAACTTCGTT GAGGGAGAAG GGTTCGGCCG ATCTCTCCTA TAGCCTTGCG
CGTGAATCGC GATTCCGTGT GAACATCTTT TCGCAGCGCG GCAGTTACGC CATCGTGATG
CGCGTCATCC CGCACAGCGT GCCCACATTC GAGCAGTTGA ACCTGCCGCC ACAACTGGCC
GACATTACCA AGCTGATCAA CGGCATCGTC CTGGTCACGG GCCCCACCGG ATCAGGTAAG
AGTTCCACCC TGGCGGCGAT CATCAACAAG GTCAACTTGG AGAAGGCGTG GCACATCGTC
ACCATTGAGG ATCCGATCGA GTTTCTTCAC CCTCATAAGC AGTGCACCAT TCACCAACGA
GAGTTGCATA GCGACACGCC GAGCTTTGCC CTCGCTCTGC GCGCTGCGCT GCGCCAGGCG
CCAAAGGTCA TCCTGGTCGG CGAAATGCGC GATCGTGAAA CCATGGAGAT TGCACTCGAA
GCCGCGGAAA CCGGCCACCT CGTCATGTCA ACTCTCCACA CCACCGACGC CTCCAAAACC
GTGGAGCGCA TCATCGGCAC CTTCCCGATT TCCGACCAGC ACATTATTCG AATCCGCTTA
GCGAAGAGTT TCCGCTACAT CATTTCGCAG CGTCTTATGC CGAAGAAGGA TAAGACCGGA
CGCGTGGCTG CCATCGAGAT TCTCAAGTCC ACCATCCGCA CTCGCGAGTA CGTAGAGAAA
GGCGAGAACG AAGGCAAGAC CTTGCTCGAC GCCATGCGCG ATGGCGACCT CGACGGCATG
CAGTGTTTCG ACGACGTGAT CGAGCGCATG ATCCGCGAAG GTGTGGTCGA CATTGATACT
GGCCTCGGAT ACTCCACCAA CCCCGGCAAC CTGCGCCTCC AGTTGGCGGA CCTGATTGAT
GCTCAGCGCG CCGCGGAATC TGAATTCGAA CCATAA
 
Protein sequence
MSASEPALSS APSPSPTPVF TTDEMLRTML KVSEKVSDLI FSPGRAPQVE LNSALVAVPG 
LPTLMPVDTR RIAGDLMGNN EQATTSLREK GSADLSYSLA RESRFRVNIF SQRGSYAIVM
RVIPHSVPTF EQLNLPPQLA DITKLINGIV LVTGPTGSGK SSTLAAIINK VNLEKAWHIV
TIEDPIEFLH PHKQCTIHQR ELHSDTPSFA LALRAALRQA PKVILVGEMR DRETMEIALE
AAETGHLVMS TLHTTDASKT VERIIGTFPI SDQHIIRIRL AKSFRYIISQ RLMPKKDKTG
RVAAIEILKS TIRTREYVEK GENEGKTLLD AMRDGDLDGM QCFDDVIERM IREGVVDIDT
GLGYSTNPGN LRLQLADLID AQRAAESEFE P