Gene Acid345_1682 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1682 
Symbol 
ID4069350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2034560 
End bp2037943 
Gene Length3384 bp 
Protein Length1127 aa 
Translation table11 
GC content57% 
IMG OID637983690 
ProductTPR repeat-containing protein 
Protein accessionYP_590757 
Protein GI94968709 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.592663 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAAAGC CCATCGGCAT TTTTGCTGTT CTTGCGCTTA GTTGTTCGTT TGCTATTGCT 
GCCGACCACA AACCCGATCC TGCCGAAGCG GCGCGATTGA ACAATATCGG CGTGGCGCTG
ATGAACCAGC AGCGCATGGA GAAAGCTGTC GAGAAGTTCG ATCTCGCATT GGAGAAGGAC
CCGAAGCTGT CGGTGGCATA TCTCGACAAA GGCATCGCGC TGCTGAATTT GCAGAAGCTG
CCGGAGTCCG AGGCCGCGCT GAACAAGGCG GGCGAGGCGA TGCCGAAGAA CCCGCGCGTC
TGGTACAACC TCGGGTTGCT GAATCGTGGG GCAGGGAAGT ACGACGCTGC GATCGAGAAC
TTCAATAGAG TTACGACGAT TGATCCCAAC GACTCCGACA CCTTCTACAT GATCGGCTCG
TTGTACCTGC AATTGCAGAA GTACGAGGAT GCAATCGGCG CTTATAAGAG TGCTCTGAAG
ATCAATCCGC TGCATGCGTC CGCTGAATTT GGGCTTGCGA AGGCCTTGCA GCGCGCAGGA
AAAGTAGAAG AGGCGCGCGA TCACCTGCAC ATCTTCGAGC ACCTGACCAA AGACAAGATC
TCCTCGCCGA TGACGTTGAT TTACGGCGAG CAGGGGCGTT ATTCGCTCGC GGAGGATGTG
CATACTGGTG CGCCGGAAGT GGGCGCGATG ATTCCGGTGA CGTTTGAAGC GCGGCCGTTG
CAAGGCGGGG CGCAAGCGGT TGCCGCGACG GCGGCCGACA CTGCCGGTCT GTGCATGATG
GACGTGAATG GCGACGGAAA ATTTGGCATA GTGGCGCTCG GCTCCGGTTC AAGTGCGATT
CGCGTTTTCC TGAATGACGG TAGCGGCAAG TTCAAAGAGG CGTCAGCCGC AGAGCATGGG
TTGAAAGCTG AAGGGACTGC GATCTCCTGC GCGGTTGGCG ATTTCGACAA CGACGGTCAT
CCGGATCTGG CGGTCGCGTT TACTGATCAA CTGCTGCTCT TCCGCAACCT AGGCAATGGC
AAGTTCGAGA ATGTTACGAA GGCTGCGGGA ATTTCGGCGT TGAACCATCC GGCTGGAATG
TCGTGGGTGG ATTACGACCA CGATGGCGAT CTCGATCTGT TCGTAACGGG TAGTGCAGTC
TCCGCGGGAA CCAACGTGCT CTGGCGCAAT AACGGTAACG GTACGTTTAC GGAAGTTGCT
GCCGAACGTG GCCTGCAGGG CATTGGCAGC ACAAAGTCCG TTGTGCTTAC CGATCTCAAT
AATGATCGTG CTGTGGATCT GCTCATCACT GGCGACACGG GTGCAACGGC GTATATCAAC
CCGCGCGAAG GCAAGTTCCA GACGTCCGCT CTCTATGAAG AGAAACTGCC GCCCGCGACC
GGCGCGTATG TATTCGACTT CAATAAAGAT GGGTGGATGG ATGTCGTGCT TACGCACGAC
GGCACCCCGG GAATTTCGCT CTGGAAGAAT TTGGATGGCA AGCACTTTGA GCGTGTAGCG
CTGCCAATTT CGGACGCGCA AGCTGCGTGG GGCGTGACCG CAATTGACGT GGACAACGAT
GGTTGGCTCG ACCTCGCTGC AGTCGTGCAG ACGGCGAAGG GGCCCGCGGT TCGGATCTTC
CGAAATACCG GATCAGCGGG GTTTGTCGAT GTGTCGAAGG CGATTGGTCT CGACAAACTT
CAGCTGCAGA ATCCGCGCGG AGTTGTTGCC GCCGATGTGG ACAGTGACGG TGCAGCCGAT
CTGATCGTTT CCCAAGGGAA TGCAGCACCG GTTGTGCTGC ACAACCACGG TGGGAGTGCG
AATCATTCTG TGCGAATTAC CCTCGCTGGT CTCGCCGACA ACAAGAGCGC GTTGGGAACG
AAGGTCGAAG TCTTTGCCGA CGGTCTTTGG CAAAAATGGG AGATCGTGGG CGGTTCAGGC
TACATGTCGC AAGGGCCGAA CGAGATTCTG GCTGGCATTG GCAAAAACAG CGCGGTCGAC
ATCGTGCGAA TGCTCTGGCC GGGCGGTGTG GTGCAGGACG AGACAGACAT CGCGATGGAT
AAGCCGGTCC ACTTCCTTGA GATCGATCGT CGCGGGAGTT CGTGTCCGAC GCTGTTTGCG
TGGAACGGCG AGAAGTATGA GTTTGTCTCC GACGTGATCG GTGCGGCAGT CATCGGCCAC
TGGATTTCGC CGACGGAGAA AAACCTCGCT GATCCCGACG AATGGGTGAA GGTGGAAGGT
TCGCAGTTGC GCGCGCGCAA CGGCAAGCTG AGCTTGCGCT TCGGCGAACC GATGGAAGAA
GTGAACTTCG TTGACCAGGT GCGGCTCGTG GCCGTCGATC ATCCCGCAAA TGCTGATGTT
TATCCCGACG AGCGCTTCCT GAGCGCGCCG CCGTTCGCGA GTGGCAAGGT CTTTGTGACT
GGTAGGCCAC ATCCGCCTGT GGGGGCGTGG GATGACGCGG GGAACGATGT GCTCGATCTC
GTGCGCGAGA ACGATCATCA GTACGTTCGC GACTTCCGCA ATCTTACGTA CGCTGGTTAT
GCCAAGCAGC ACGCATTAAC GCTTGATCTC GGTGAATGGA GTCCGAACGC GCCGTTGCGG
CTGTTCCTGC AAGGCTTTAT CGAGTACTTC ACCGCAAATT CGATGTACGC GGCTTGGCAG
GCGGGAATCA ACCCGGTTGC GCCTTATATT GAGGCGCAGA TGCCGGATGG TTCATGGAAG
CGAGTTGTGG ATGACATGGG TTTCCCGGCT GGATTGACGC GCATGATCAC TGTAGACCTG
ACCGGCAAGT TGCCGGCGAA CACGCGCAAG ATTCGTATCG TGACCAATCT TCAGATTTAT
TGGGACCAGG TGCTGGTGGA CAACGCGGCT CCGGCGGCGA AGACCCGCGT AACCGAATTG
CCGCTGTTGT CGTCGGACCT CCAGTTCCGC GGCTATCCAC AGCAGGTCGA CGGCGAAACT
CCGGGTGATC TGACTTACAT CTACGAAAAG GCCAGTAAGA CCGGGCCCTT CACCCGTGAG
CGCGGGAACT ACACGCATTA CGGCGACGTG ACCGAACTGC TGAAGCAAGT GGACGACCAT
TACGTGATCT TTGGCAGCGG GGAAGATATG GACCTTGAGT TCGATCCCGC CGCCTTGCCT
AAGCTGCCTG CAGGATGGAA GCGCGACTAT TTCTTCTACG CGAATGGCTT CGTGAAGGAC
ATGGACTTCT ACGAGGCGAC GCCATTCACG GTGGCAGACT TGCCATTCCA CAGGATGTCG
GCATATCCGT ATCCGGTGGG CGAGCATTAT CCGGATGATC TTGACTCGGT GCGTTACCGG
CTGGAATGGG ACGATCGTTT TGACTCTGGC ACAAACGGAG CTGGGAACCA CTTTGGCTTC
GATTATGAAA ATCGCCGTCA ATAG
 
Protein sequence
MRKPIGIFAV LALSCSFAIA ADHKPDPAEA ARLNNIGVAL MNQQRMEKAV EKFDLALEKD 
PKLSVAYLDK GIALLNLQKL PESEAALNKA GEAMPKNPRV WYNLGLLNRG AGKYDAAIEN
FNRVTTIDPN DSDTFYMIGS LYLQLQKYED AIGAYKSALK INPLHASAEF GLAKALQRAG
KVEEARDHLH IFEHLTKDKI SSPMTLIYGE QGRYSLAEDV HTGAPEVGAM IPVTFEARPL
QGGAQAVAAT AADTAGLCMM DVNGDGKFGI VALGSGSSAI RVFLNDGSGK FKEASAAEHG
LKAEGTAISC AVGDFDNDGH PDLAVAFTDQ LLLFRNLGNG KFENVTKAAG ISALNHPAGM
SWVDYDHDGD LDLFVTGSAV SAGTNVLWRN NGNGTFTEVA AERGLQGIGS TKSVVLTDLN
NDRAVDLLIT GDTGATAYIN PREGKFQTSA LYEEKLPPAT GAYVFDFNKD GWMDVVLTHD
GTPGISLWKN LDGKHFERVA LPISDAQAAW GVTAIDVDND GWLDLAAVVQ TAKGPAVRIF
RNTGSAGFVD VSKAIGLDKL QLQNPRGVVA ADVDSDGAAD LIVSQGNAAP VVLHNHGGSA
NHSVRITLAG LADNKSALGT KVEVFADGLW QKWEIVGGSG YMSQGPNEIL AGIGKNSAVD
IVRMLWPGGV VQDETDIAMD KPVHFLEIDR RGSSCPTLFA WNGEKYEFVS DVIGAAVIGH
WISPTEKNLA DPDEWVKVEG SQLRARNGKL SLRFGEPMEE VNFVDQVRLV AVDHPANADV
YPDERFLSAP PFASGKVFVT GRPHPPVGAW DDAGNDVLDL VRENDHQYVR DFRNLTYAGY
AKQHALTLDL GEWSPNAPLR LFLQGFIEYF TANSMYAAWQ AGINPVAPYI EAQMPDGSWK
RVVDDMGFPA GLTRMITVDL TGKLPANTRK IRIVTNLQIY WDQVLVDNAA PAAKTRVTEL
PLLSSDLQFR GYPQQVDGET PGDLTYIYEK ASKTGPFTRE RGNYTHYGDV TELLKQVDDH
YVIFGSGEDM DLEFDPAALP KLPAGWKRDY FFYANGFVKD MDFYEATPFT VADLPFHRMS
AYPYPVGEHY PDDLDSVRYR LEWDDRFDSG TNGAGNHFGF DYENRRQ