Gene Acid345_1391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1391 
Symbol 
ID4068926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1688187 
End bp1689314 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content59% 
IMG OID637983400 
Producttwitching motility protein 
Protein accessionYP_590467 
Protein GI94968419 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2805] Tfp pilus assembly protein, pilus retraction ATPase PilT 
TIGRFAM ID[TIGR01420] pilus retraction protein PilT 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.899661 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0234159 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTTA CTCTTAGCGA TCTGTTGAAA AAGATGTTGG AGATGCAGGG CTCTGACCTG 
CACATCACCA CGAACTCGCC GCCGCAAGTG CGGGTTCACG GCAAACTGGT TCCCCTCGAC
CTGGCGCCGC TAACTCCTGC CGAAACGAAG CAGCTGGCCT ATAGCGTCAT GACTGACGCC
CAGAAGCACC GTTTCGAAGA GGACCTCGAG CTAGATTTCT CGTTCGGATT GAAGGGACTC
GCCCGTTTCC GCGCCAACTG CTTCAACCAG CGCGGCGCGT GCGGCTCCGT TTACCGCGTC
ATTCCATTCG AGATCAAGAA CTTCGACCAG CTCGGACTGC CCGCAGTCGT TTCCAAGCTC
TGCGATCGTC CGCGCGGCCT GATCCTCATC ACTGGCCCGA CCGGTTCCGG TAAGTCCACC
ACGCTCGCGG CCATGATCGA CAAGATCAAT ATTGACCGTC ACGAGCACAT CATCACCATC
GAAGATCCGA TCGAGTTCGT GCACCAGCAC AAGAACTGCC TGATCAACCA GCGCGAAGTC
CACTCCGATA CCAAGGGCTT CTCGCAGGCG CTCCGCGCCG CTCTCCGTGA AGACCCCGAC
GTGGTCCTGA TCGGCGAAAT GCGCGATTTG GAGACGATTG AATCCGCGTT ACGTATTGCA
GAAACCGGCC ACTTGACGCT GGCTACCCTG CATACCAACT CGGCAAGCTC CACCATCAAC
CGTATTATTG ACGTCTTCCC TTCGCACCAG CAGTCGCAGA TTCGCGCGCA GCTCTCGCTG
GTGCTGGAAG GCATCATGTG CCAATCGTTG TTGCCGAAGG TCGGCGGTAA CGGTCGCGCC
ATGGCCATGG AGATCCTGGT TCCGAACGCC GCTGTCCGCA ACCTCATCCG CGAAGACAAG
ATCCACCAGA TCTATTCGTC GATGCAGACC GGCCAGGACA AGTTCGGCAT GCAGACCTTC
AACCAGGCGC TGGCAACGCT GGTCGCCCAG AAACAGATCA CGATGGAACT CGCCGTGCAG
CGCTCGTCGA TGCCGGAAGA GTTGCAGGAC ATGATCGCCC GTGGACACAC CCTGCAAGGT
CGAGGGGGCA CCACAGCCGT TAATGCCGCC GCACCAACGC GGAGATAG
 
Protein sequence
MAVTLSDLLK KMLEMQGSDL HITTNSPPQV RVHGKLVPLD LAPLTPAETK QLAYSVMTDA 
QKHRFEEDLE LDFSFGLKGL ARFRANCFNQ RGACGSVYRV IPFEIKNFDQ LGLPAVVSKL
CDRPRGLILI TGPTGSGKST TLAAMIDKIN IDRHEHIITI EDPIEFVHQH KNCLINQREV
HSDTKGFSQA LRAALREDPD VVLIGEMRDL ETIESALRIA ETGHLTLATL HTNSASSTIN
RIIDVFPSHQ QSQIRAQLSL VLEGIMCQSL LPKVGGNGRA MAMEILVPNA AVRNLIREDK
IHQIYSSMQT GQDKFGMQTF NQALATLVAQ KQITMELAVQ RSSMPEELQD MIARGHTLQG
RGGTTAVNAA APTRR