Gene Acid345_4608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4608 
Symbol 
ID4070765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5457809 
End bp5462023 
Gene Length4215 bp 
Protein Length1404 aa 
Translation table11 
GC content61% 
IMG OID637986648 
ProductTPR repeat-containing protein 
Protein accessionYP_593682 
Protein GI94971634 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.406661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAAAA GTTTTATTTA CTGTTCCCTG TTCCTCTCAT TCGTCGTATC TGCGGCAGTT 
GCGCAGGACT TCGAGATCAA CGGCGGCCAG CAGCAGCAAA CACCGGCGCC GCAGAAAGCC
GGTAAAAAAT CCAAGGGCAA AGCTGCGTCT TCCGCTCCCG CCAACGGCGG CAGCAACGAA
ATCGGTTGGG GGAATAGCAT CGAGGTCGGC CGTCTGGCTC GCGCCGCGCA GGATGCCCTG
AAGCACGGCA ATCCTGCCGC GGCCGCCACG TATGCTCAGC GCGCCGTCCA GGCTGCTCCG
CAGAACAGCA AGCTGTGGCT GCTCCTTGGC TACACCTCGC GCCTCGCCGG ACGCAACCAG
GAATCGATCA ACGCCTACAA CCACGCGATC CAGACCGATC CCAAGTCTCT CGACGCGAAA
TCAGGCCTCG CCCAGACCTA TATGAAGATG GGCCGCACCG ACGAAGCCAA GCGCTTGCTC
GCGCAAGTGC TCGCCGCCGG TTCCACCCGC CAGAATGACC TCCTCGTCGC CGGCGAACTC
TATCTCAGGA CCAAGGACTA CCAGCAGGGC ATCAACTACC TGCAACGCGC TGACAACCTG
AAGCCCAATG CGCACGCCGA GCTGCTGATG GCGATGGGCT ACATGAAGAT GAAGCAGCCG
CAGAAGGCCA AGCAGCTTCT CGACATGGCC AAGCGCCGCG CGCCGAACAA TGTCGAGATC
TTCCAGGCGG TCGCCACTTT CTATCGCGAA GACCACGACT ACAAGAACGC CATCGCGACG
TTGAACAGCG CGCCGCGCAA GACGCCCGCA TTGCTCGCCG ACCTTGGCTT CACCTACGAA
CTCTCTGGTG ATAAGCAGAG CGCCGCCGCT ACTTACGTGA AGGCGGCCAA CGCCGCGCCG
AAAGAAATCA AGTTCCAGCT CAGCGCCGCG CAATCGCTCA TCCAGAGCGG CGATAAGACG
AAGGCCCAGG ATTTCCTGAA GCGCGCCGCC GCCATCGATC CGAATCATTA CCGCTTGCAC
GCCATCCGCG CCGGACTCGC GAAGTCCGAG AACCGCAACG ACGAAGCGGT GAAGGAATAT
CAGCTCGCCC TTTCCGCGAT GCCGAAGGAA GGCGTCCCCG AAGGCCAGCT CTATCCGGTG
CTTCTCCGCT TAAATTTGTC TGAAGCGCTG AAAGATACCG GCAACACCGA AGCCGCCAAA
CAGCAAGTTG AAATTGCCGA GCAGGAAATT TCCAAGATTA ACGTGGAAGG CCCGGCAAAG
GCTGAGTTCC TCCGCGTGCG CGCGTCGATC AAGGCCAGTG GCGAAGACTA CGCCGGTGCC
GAAGCTGATC TGAAGGAAGC GCAGAAGCTC GATCCCGACA ACCAGCTCAT CACCCTCCAG
TACGCGAACC TGCTCTGGAA GGCCGGGCGC AAAGACGAGT CGAAGCAAAT GTACCTCGGC
ATTCTTCAGG GCGATCCGAA AAATCGTTAC GCCCTCGAGG CCCTCGGTTA TCTCGCGCGT
GACGTGGGCG ACACCGCCGG CGCCGAGCAT TACTTCACCG CGCTCGCGCA GGCTTATCCC
GACGACTACA TCGCATACCT GGCGCTCGGC GATCTCTACA CCGCCACTCG CGACTTCACC
CGCGCTCTCG CCGCCTACGA CAAAGCCCAC GAGCTAGCGC CCAATAACGC AATCGTCATC
GCCAACGCCG CCAATGCCGC CATCGAAGCT CGCCAGTTCC CGCTCGCCGG ACGCTGGATC
GCGATGGCCA CGGGCGAGAT GGCTGACGAG CCACACGTGC TCGTCGAAAA AGAACGCTAT
CTCTTCCACA GCGGCAAGTA TCAGGAAGCC GCGGTCGCCG GCCAGCGCGC GCTGGAAAAG
CTGCCGAAAG ACCGCAACGC CAGCGTCTAT CTTGCCTACA CGTATTACAA CCTCGGCCGT
TATGACGACG TCCTTGCCCT CAGCGACAAG TACGACAACA TAATTCCGAA GGAACCGAAC
TTCCCGCTCC TCGAAGGCCA CGTGCATCGC CAGTCGCAGC TGACGGACGA AGCCGTACAG
GACTACACCC GTGCTCTCGA ACGCGATCCC AAGATGGTGG AAGCCTACGT CAACCGCGGC
TATGCATTAA ACGATCTGCA GAACGCCGAG CAGGCCGCGC AGGACTTCCA CGCCGCGCTC
CAGCTCAATC CGAACAACGG CACCGCGCAC CTCGGCCTGT CGTTCTCCGA GCTGCAACTG
CATCACGGCA AAGAAGCCCT GGCGGAAGCT GCCGCCGCCG AAAAGATTAT GGGCGAGTCC
GGCGCGACGC ACCTCGCCAC TGCCACCGCG TATCGCCAAC AGCGTCTGAT GGCGCAGGCC
GAGAAGGAAT ACATTGCGGC CATCAAGTTC GCGCCGCAGG ATCCCAAGCT CCACCTGGCG
CTGGCGGAAA CGCAGTACAA CGAGCGCAAG TATCAGCAGT CGCTCAATAC GCTGGCCGAC
GCTCTCACCC TCGATCCCAA CGATCCGCTG ATCTACGCCC AGATGGCGCA TGCTCACGCG
GAGCTTAAGC ATCGTGACGA GACGCTTCGT TACGTAACGC TAGCCGAGCA AACCGGCGGC
GAGAAATCCG CCATCCTGCT CGACACCGGA GAAGCCCTGC TCACCCTCGG CGATCGTGAT
GGCGCGATGA AGCGCTTCGA ACGCGCCCTC GATGCCCCCG ACGCCGACCG CGTCCAGGCC
CGCCTCGCCA TCGCGCGCCT CATGGTCCAC GACGGCAAAA ACGGCGACGC TCGCCAGCAG
GTCGCGCTCG CTTTTGCAGA AAGCCGTATT GGCGAAGCTT CGCCGGTCAC CCCCGACGAT
CTCGTCGAAG CCGCCAACAT TTTCCTCTCG ATCAACGATT TCGATCTCGC ACAGCGCTAC
TTCGTAAAAG CGAAGGACGC CGGCGCTGCC GATGAAGTCG TCGCCATCGG CCTCGCCAAT
GTTGCGCTCG TTCGCGGCAA CACCAACGAA GCCCAGGTCC AGCTCGCTTC CGTCGGAGAT
CCGGCGGAAC AGGCACAGAA CTACGATTAC CAGCTCGCTC TCGGCGATAT GTATCGCCAG
CAGCGCAACG GACAACTGGC CCTCAGCGCC TTCGCGCGCG CTAACCAACT GAGCGGTGCG
GACAACACGA CCGCCGAACG CGCTCTCTTT GAAACCGCCG GTCAGGAGGG CATGCACATC
AACGACAAGT TGAGCCTGCT CTCCGACCTC GACGTCCACG GCATCTTCGA TAACGCGACC
ATCTACAACC TCGATCGCCA GATTTTCGGC GTCGTCGGGA GCCAGCAGCC GATTCCGCCG
CGCTCCACCA CCGAGTCACT CTGGACCAAT GGCTACCGCC TGCACCTCAA TGGGCTCCCG
TTGATCAGTG GTTTCTTCCA GCTTCGTAAT GCTCGCGGTG AGTTCTCCGT GCCCAGCAAC
GCGCTGATCG TTCCCACTGA TACCTACGAT TACAACTTCA ACACCGCAAT CAATCCCACG
CTGAAGATGG GCTTCCTCCG TCTCGACTTC AACGCCGGCG TCCAGTTCAC GGTGCGCCGC
GACAAAGATT CGCCGGTCGT GATGAACCAG AACCTCTTCC GCCAGTTCGT CTACCTGCAG
TCGAATTCCA TCGGCGACTG GCTCCAGATT CGCGGCGAGG CCTATCACGA AGCCGGGCCC
TTCACCGACC GCGACTTGAG CTCGCGCGAC GTCGGCACCC GCCTCGAGTT CGTCGTCGGC
CGGCCATGGG GACGCAATGC CCTCCTCACC GGCTACAGCG CCCGCGACAT TCAGTACAAC
CCGAGCATTC GCGAATTCTT TACCACCAGC ACATATGCCG GTTTGCAACA CACTTTCGGA
CGCGAACGCA GTTTGACTGT CGCCGCGCTC GGCGAATACA TCCGCTCCTG GCGCGTGCAG
GACGACTTCT ACGCCATCGC GCAGGCGATC GAACCCGGCG GCCGAATTAC GTGGCAACCA
AATAATCGCT GGAAGCTAGA TGGAAATTTC GCGTGGGGTA AAGGCGAAGG TTTCGCCTTA
TATGACAACG TGCAGAGCAG TTTCTTCATC TCTTATGTAA AGCCGTTCCG CCGCTCGATG
ACGGATGCGT TTGGGGATGT ACCTGTTGAG TATCCGCTCC GTTTCTCCCT CGGGGTTCAG
ACGGACAACT TTTATCACTT TACTGGCCGC GGACAGACTC AGATTCGACC TGTAATCCGT
CTGACGCTGT TTTAA
 
Protein sequence
MKKSFIYCSL FLSFVVSAAV AQDFEINGGQ QQQTPAPQKA GKKSKGKAAS SAPANGGSNE 
IGWGNSIEVG RLARAAQDAL KHGNPAAAAT YAQRAVQAAP QNSKLWLLLG YTSRLAGRNQ
ESINAYNHAI QTDPKSLDAK SGLAQTYMKM GRTDEAKRLL AQVLAAGSTR QNDLLVAGEL
YLRTKDYQQG INYLQRADNL KPNAHAELLM AMGYMKMKQP QKAKQLLDMA KRRAPNNVEI
FQAVATFYRE DHDYKNAIAT LNSAPRKTPA LLADLGFTYE LSGDKQSAAA TYVKAANAAP
KEIKFQLSAA QSLIQSGDKT KAQDFLKRAA AIDPNHYRLH AIRAGLAKSE NRNDEAVKEY
QLALSAMPKE GVPEGQLYPV LLRLNLSEAL KDTGNTEAAK QQVEIAEQEI SKINVEGPAK
AEFLRVRASI KASGEDYAGA EADLKEAQKL DPDNQLITLQ YANLLWKAGR KDESKQMYLG
ILQGDPKNRY ALEALGYLAR DVGDTAGAEH YFTALAQAYP DDYIAYLALG DLYTATRDFT
RALAAYDKAH ELAPNNAIVI ANAANAAIEA RQFPLAGRWI AMATGEMADE PHVLVEKERY
LFHSGKYQEA AVAGQRALEK LPKDRNASVY LAYTYYNLGR YDDVLALSDK YDNIIPKEPN
FPLLEGHVHR QSQLTDEAVQ DYTRALERDP KMVEAYVNRG YALNDLQNAE QAAQDFHAAL
QLNPNNGTAH LGLSFSELQL HHGKEALAEA AAAEKIMGES GATHLATATA YRQQRLMAQA
EKEYIAAIKF APQDPKLHLA LAETQYNERK YQQSLNTLAD ALTLDPNDPL IYAQMAHAHA
ELKHRDETLR YVTLAEQTGG EKSAILLDTG EALLTLGDRD GAMKRFERAL DAPDADRVQA
RLAIARLMVH DGKNGDARQQ VALAFAESRI GEASPVTPDD LVEAANIFLS INDFDLAQRY
FVKAKDAGAA DEVVAIGLAN VALVRGNTNE AQVQLASVGD PAEQAQNYDY QLALGDMYRQ
QRNGQLALSA FARANQLSGA DNTTAERALF ETAGQEGMHI NDKLSLLSDL DVHGIFDNAT
IYNLDRQIFG VVGSQQPIPP RSTTESLWTN GYRLHLNGLP LISGFFQLRN ARGEFSVPSN
ALIVPTDTYD YNFNTAINPT LKMGFLRLDF NAGVQFTVRR DKDSPVVMNQ NLFRQFVYLQ
SNSIGDWLQI RGEAYHEAGP FTDRDLSSRD VGTRLEFVVG RPWGRNALLT GYSARDIQYN
PSIREFFTTS TYAGLQHTFG RERSLTVAAL GEYIRSWRVQ DDFYAIAQAI EPGGRITWQP
NNRWKLDGNF AWGKGEGFAL YDNVQSSFFI SYVKPFRRSM TDAFGDVPVE YPLRFSLGVQ
TDNFYHFTGR GQTQIRPVIR LTLF