Gene Acid345_3602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3602 
Symbol 
ID4072824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4262219 
End bp4263889 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content61% 
IMG OID637985625 
ProductTPR repeat-containing protein 
Protein accessionYP_592677 
Protein GI94970629 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0222501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCGG GCTTCGTCGC CCTGCTGTTT GCCCCGACCT TGCGGTACGG CTTCGTCTAC 
GACGACCGCG TGGTGATTCT CGAGAACCCG TACATCCAGT CGTGGCGATG GTTCCTGCAC
GATTTCACCT CCCACGTTTG GGCCCAAGTC TCGAGCCAGC CCGCCAGTTA CTATCGTCCG
GTTTTCATAA CCTGGCTGCG CCTGAACCAC ACCCTCTTCG GTTTCCAGCC GTGGGGATGG
CACCTGACCA CGGTACTGGC GCACGTGGTG GCAACGCTGC TCGTCTATCG CTTGGCGTTG
CGGCTTTTGC GAAGCCCCTG GCAGGCGGCG ATTGCGGCTT TAATCTTCGC GGTTCATCCA
GTCCACGTGG AAAACGTCGC ATGGGTCTGC GGATCGGTGG ATCCGCTCAT GTCGATTTTC
TATCTCGGCG CGATATTGAG CTATTTGCGT TGGCGCGAAC AGCATTCGCC GGTTGCGCTC
GCCGTCTCGG TGCTGCTGGC GCTGACAGCT ACGCTTACGA AGGAGATCGC GATCACACTG
CCCGCGGTGA TCTTCGCATA CGCAGCGCTC TTTAGCCCCG CCGAAGCTGG TTGGGGCAAG
CGGTTTCTCG CGGCTGCGAG AGATACCGTG CCCTTCGTGC TTGTCGCTGC CGCCTACATG
GCCGCGCGCA GTGTGGTGTT GCATGGGCCA ATATTCGCGG AGATATCGCT GGCAACCGTG
CTGCTCACAC TGCCGGGACT TCTGCTCTTC TATGCGCGGC TGACGGTCTG GCCGGTGAAT
TTGAGTCTCT TCTACAACCG CGCGCCGGTG CAGAGTTTCA GCGCGCAGCG CGTACTGCTT
CCGCTGCTGG TTCTGGCGCT GATCGCGGCG GGTCTGTTCA TGTGGCTGCG GAGAAGTAAA
CAGCGACGTG AAGGCCTGTT CGCGCTTGTT CTGGCGCTGC TGGCGCTGTC GCCGCCTCTG
TATATCCGCT TGTTCAATCC GGACGACTTC GTGCACGACC GCTATTTGTA CCTGTCGATG
GCGGGTGTCG CGATGCTTGC GGCGATGGCT ATAACTTCAA TCAGGGGCGT CAAAGATGGA
AAGTCGTTCG CGCCGCTCCC GCAGGTGCTG GTAGTAGCGG CCATCACGCT TGCCCTGAGC
CTCGGAACGA CAATGACGAA CGGCAACTGG CGCGATGATC TTTCGCTCTG GGGGCATTGC
TTCAAGGTAG CTCCGCACAA CGTGCGCGTA CTGAACAACC TCGCCTCGTC GCTCGGGGAG
TCGGGCGCCT ACCAGGTCGC GGTGCCAATG TTCCTCGAAG TACTGAAACG CGATCCGAGC
AATGCTCGCG CTAACGCGAA CCTTGGCTAT ACGCTCTACC AGGCAGGCGC GCTGGAGCAG
GCGGAGAAGT ATCTCTCGAA AGCAGTGCTG TTGAACGCCT CCGATGCCCA TTCCTGGCTA
TATCTTGGCG TGACTCACCT CAAGCTAGGC GCGACTGCGG AAGCGGAGTC GGACCTGCGC
CAGGCGATAA CGATAGACCC CTCGGCCACC GGCGCGCACT TGGCTTTATC AGTGGTGCTC
GAACAGCGCG GTGACCGGGC GGGCGCCATC GCCGAATCGC AAGAAGAGCT GCGCTATCAC
CCCGAGGAAC AATCGGTGCA GCAGCGCCTC CAACAACTGC AGGCGAAATA A
 
Protein sequence
MLAGFVALLF APTLRYGFVY DDRVVILENP YIQSWRWFLH DFTSHVWAQV SSQPASYYRP 
VFITWLRLNH TLFGFQPWGW HLTTVLAHVV ATLLVYRLAL RLLRSPWQAA IAALIFAVHP
VHVENVAWVC GSVDPLMSIF YLGAILSYLR WREQHSPVAL AVSVLLALTA TLTKEIAITL
PAVIFAYAAL FSPAEAGWGK RFLAAARDTV PFVLVAAAYM AARSVVLHGP IFAEISLATV
LLTLPGLLLF YARLTVWPVN LSLFYNRAPV QSFSAQRVLL PLLVLALIAA GLFMWLRRSK
QRREGLFALV LALLALSPPL YIRLFNPDDF VHDRYLYLSM AGVAMLAAMA ITSIRGVKDG
KSFAPLPQVL VVAAITLALS LGTTMTNGNW RDDLSLWGHC FKVAPHNVRV LNNLASSLGE
SGAYQVAVPM FLEVLKRDPS NARANANLGY TLYQAGALEQ AEKYLSKAVL LNASDAHSWL
YLGVTHLKLG ATAEAESDLR QAITIDPSAT GAHLALSVVL EQRGDRAGAI AESQEELRYH
PEEQSVQQRL QQLQAK