Gene Acid345_3005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3005 
Symbol 
ID4071560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3559710 
End bp3563000 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content59% 
IMG OID637985024 
ProductTPR repeat-containing protein 
Protein accessionYP_592080 
Protein GI94970032 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000321187 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCATTT CTTCTGTTCG GCTCTTTGGC ATCACGCTTC TTTCGTCAAT ACTCTCGGTC 
GCTTCCTTTG CTCTCGCGCA AGACCAGAAG CCTGCTCCGC CGGCGCCTAC CGGCCCGAAC
GTCACTCCGC CGAAGGTCGA TAGATCCGAC GACGCCGATT ACGCCAAGCA GGCCTATGTT
TACGAGAAGT TTGAAGACAA ATGGCGCTTT GAGACCGACG GCACCGGCCA GGAAACCACC
ACGCTGCGCG TCAAGATTCA AAGCGATGCC GGCGTGAAGG CGTGGGGCCA GCTCGTCTTC
GGCTACAACT CCGACAGCGA CGAGATGCAG ATCGCCTATG CGCGCGTCCG CAAGCCCGAC
GGCAAAGTCA TCGACACACC GCCCGATTCC GTTCGCGATA TGACCAGTTC CGTTGAACGT
GAAGCCCCGG TCTACACTGA CTATCGTGAG AAGCACCTGA CCGTTTCTTC GCTCCAGGTC
GGCGACATCC TTGAGTACCA GCGCATTCGC AAAATCACGA AGCCCGCTAC GCCGAATGAG
TTCTTCACCG AGCACACCTT CATGAAGAAC TACATCATCC TCGACGAACA GATCGAGTTC
GATGTTCCCG TCAAGGCCAA CGCCAAGCTC AAATCTCTCC CGGGCTTCGA ACCTACCTCC
ACGCGTACCG AGGGCGATCG CACCATATAT TCCTGGAAGC GCTCAAACCT GAAGGTCGAG
GACGAAGAAG AGAAGCAGAA GCGCGAAAAG AAAAAGGGGA AGAAGCCGCA GGAATTTGCC
GACGTCCAAC TCACCACCTT CAACAGTTGG GAGCAAATCG GCAAGTGGTA CCAGGACCTC
CAGCGTGATC GCGTTGCGCC CACGCCTGAA ATCAAAGCCA AAGCCGCGGA GCTGACCAAG
GGCCTCACTA CCGACGAAGA CAAGATCGCT GCTCTCTATC GCTACGTCGC CACCGGCTAT
CGCTACGTGA GCCTCTCGCT CGGCGTCGGA CGCTTCCAAC CCCGTGCTGC CTCCGTCACC
ATGCAGGACA AGTACGGCGA CTGCAAAGAC AAGGCGACGC TTCTCTCATC GCTACTGATC
GCATCCGGCT ATAAGCCCGC GAACGTCCTC ATCCACACCT TCGTCAAGCT CCAGGATGAC
TTCCCGACCC CAGCCTCCTT CAACCACGTC ATCACCGAAG TCAAAGCCGG CGATAAAGAA
TTCTGGATGG ACAGCACGAC GGAGGTCGCA CCCTTCCGCC TCCTCACCTG GAACATCCGC
AAAAAGAAAG CCCTGCTCGT TCCCGTGGAT GGCCAGCCGC ACGTCGTGGA AACGCCCGCC
GATCCGCCAT TCACCAGCCT TGAGACCATC AACGTCGCCG GCAAGATCAA CGAGCTCGGC
ACCGCTGACC TCCACCTGCA GATCATCTCG CGCGGCGACA GCGAGCTCCA GCTTCGTAGC
GTCTTCCGCA ACTACGGCCA GGCAAACTAT CAGAAATTGA TGGAGAACAT CTCGCGCGTC
CTCGGCGTCC CCGGCGACGT CAGCGACGTC AAGGTCTCCG ATCCCGCCGA CACCGTGAAG
CCGTTCAGCA TGGAGTGCAA CGTCAAAGTC CAGAACGCGA TCGAGTGGAA AGACAAGACC
GGCACCCTCG GCGTTCCGTT CGGCTCCATG AATCTCTCCG AAGAGCCGTC GGATCCCGGC
CCTGACACCG AACCGCTCCC GCTCGGCGGA TCTCCCGGCG AGTATCGCGT CATCATGAAA
GTTGACCTGC CGGAGAAGTA CACCCTCCGC CTGCCTGCCT CCATGAGCGT CAAGCGCGAT
TACAGCGAGT ACTCCTCGAA CTACACGCAG GACAAGTCCA CATTCGTCGC CGAACGATAC
CTGCACATCA TGCAGCGTGA AGTGCCGATC AAACGTTTCG GTGATTACCA CGCCTACCGC
CTCGCGGTGA ACTCCGATCA CGGGCAAGCG CTCACCCTCA CCCGCACCGA CGCCAGCGTC
GCCGGCGCCG AAAAAGACGC CAAGGCCGAC GACCTTTTCG ACGCCGCGCA GGCCGCCGTT
CGCGCCGAAA ATTATCAGAA CGCCATCGAG CTTCTGCAGC GCGCGCTCGT CCTCGAGCCT
GAGCACAAGT ACGGATGGGA CGCCCTCGCC GAGACCTACT ACAACGCCGG CGACCTCAAC
AAAGCCATCG AGTACTACAA GAAGCAGCTT GAGGTGAATC CCTACGACGA CCTCGCCAAC
ACCGGTCTGG CGCAGGTCTA CATGACGCAG TACAAGTACG ACGACGCTCT CGCGGCCTTC
AAGAAGCAGG CCGAAATCAA TCCGCTCGAC AAGACTGCGC ACCTCGGCAT CGGTCAGGTC
GACATCATTC GTGAAGACTA CAAGGCCGCC GTGCCTGAAC TCGAGCGCGC CGTATCGATC
CTTCCGCAGT CGTCGGTCGC TCGCTACATG CTGGGCAACG CGTATCTCAA CACTGGTCAG
ACCGAGAAGG CGATCACCGC CTTCGAAGAA TCCGTCAAGC TCGACGCCAA CAATCCCATG
ACGTGGAACG ACATCGCCTA CGCGCTCGCC GACAAAGACG TCAAACTCGA CAAGGCCGAG
CAGTACGCGC AGAGTTCCGT CAGTACCACG CAGTCCTACC TGCGCAACCT GCCCGCGGAG
CAGGCCCTCA AAGCTGGCCC GCAAATGACC GCCAGCCTCG CCGCTGCCTG GGACACCCTC
GGCTGGGTCT ACTACAAACA GGGCAAACAG AAAGAAGCCG AAGAATTCAT CCACGCCGCT
TTCGATCAGG ACCCGCACTC CGAAGTCGCG GAGCACCTCG CGATCTTCGC CGAGAAGCGT
AACGACAAGA AGGCAGCCGC CGAGTACTAC GCCATGGCCC TCGCTGGCGA TCGTCCTGCG
CCTCGCTATC GTGAGAAGCT GATCACCCTC GCTTCGATCA AGGATGCCGA CGTCGAGGCG
AAGATCAAAG AAGCCAAGGT CAAGCTCGAC GCTGAACGAT TCCTCAAGCT CAACAACGCT
GGTTGGACCG GTAAAGCCGA ATTCGTACTG ACTTTTACCG CTTCCAAACA AGCGTCAGAT
GCGCAGTGGA AGTCTGGCGC CGACTCGCTC AAGCCCGCGG CCAAAGCGCT TATGGCGATG
AGCTACCCCA TCACTCTGCC GTCAGGGGAG TACCGCATCT TCCGTCGCGT GCTCGTCAGT
TGCGAAGCAG GGAAGGACTG CAGCGTGTTG CTGTATGGCG CCGAAGATCG CGAGAGCACC
GTTGATGTGC CCACCGCAGC GTCAATGAGC GACACCAAAC CCGCGAATTA A
 
Protein sequence
MRISSVRLFG ITLLSSILSV ASFALAQDQK PAPPAPTGPN VTPPKVDRSD DADYAKQAYV 
YEKFEDKWRF ETDGTGQETT TLRVKIQSDA GVKAWGQLVF GYNSDSDEMQ IAYARVRKPD
GKVIDTPPDS VRDMTSSVER EAPVYTDYRE KHLTVSSLQV GDILEYQRIR KITKPATPNE
FFTEHTFMKN YIILDEQIEF DVPVKANAKL KSLPGFEPTS TRTEGDRTIY SWKRSNLKVE
DEEEKQKREK KKGKKPQEFA DVQLTTFNSW EQIGKWYQDL QRDRVAPTPE IKAKAAELTK
GLTTDEDKIA ALYRYVATGY RYVSLSLGVG RFQPRAASVT MQDKYGDCKD KATLLSSLLI
ASGYKPANVL IHTFVKLQDD FPTPASFNHV ITEVKAGDKE FWMDSTTEVA PFRLLTWNIR
KKKALLVPVD GQPHVVETPA DPPFTSLETI NVAGKINELG TADLHLQIIS RGDSELQLRS
VFRNYGQANY QKLMENISRV LGVPGDVSDV KVSDPADTVK PFSMECNVKV QNAIEWKDKT
GTLGVPFGSM NLSEEPSDPG PDTEPLPLGG SPGEYRVIMK VDLPEKYTLR LPASMSVKRD
YSEYSSNYTQ DKSTFVAERY LHIMQREVPI KRFGDYHAYR LAVNSDHGQA LTLTRTDASV
AGAEKDAKAD DLFDAAQAAV RAENYQNAIE LLQRALVLEP EHKYGWDALA ETYYNAGDLN
KAIEYYKKQL EVNPYDDLAN TGLAQVYMTQ YKYDDALAAF KKQAEINPLD KTAHLGIGQV
DIIREDYKAA VPELERAVSI LPQSSVARYM LGNAYLNTGQ TEKAITAFEE SVKLDANNPM
TWNDIAYALA DKDVKLDKAE QYAQSSVSTT QSYLRNLPAE QALKAGPQMT ASLAAAWDTL
GWVYYKQGKQ KEAEEFIHAA FDQDPHSEVA EHLAIFAEKR NDKKAAAEYY AMALAGDRPA
PRYREKLITL ASIKDADVEA KIKEAKVKLD AERFLKLNNA GWTGKAEFVL TFTASKQASD
AQWKSGADSL KPAAKALMAM SYPITLPSGE YRIFRRVLVS CEAGKDCSVL LYGAEDREST
VDVPTAASMS DTKPAN