Gene Acid345_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1549 
Symbol 
ID4072940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1894436 
End bp1897633 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content61% 
IMG OID637983558 
ProductTonB-dependent receptor 
Protein accessionYP_590625 
Protein GI94968577 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114571 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCCCTT TGCGAACGCT CTGCGTCGTT CTCCTGCTCT CTTTTTCTGC GTTCGCACAG 
GTCTCTGCGA ATCTCTCGGG CTTTGTCACC GATCCCAGCG GCGCCGCCGT GGCCGGCGCA
GAAGTACGCG CCATCAACAA TGAGACCGGT GCGGTTCGCG TCTCCGCGAC CAATGCCTCA
GGACGCTACG ATATTGTCGC GTTGCCCGTT GGACAGTATG AAGTCCACGC GAGGAAGCAG
GGCTTCGCCG AACAAGTGCG CACCGGCATC CTTCTCGTAG TCGGGCAGGA TGCTACCGCC
GACCTCAAAC TCCGCATCGG CGACGTAAGC GAGCAGGTGA AGGTGAGCGC CGACGCCGAA
ATGGTCGGCG TTACCAACCA GGACATCTCT GGCCTGATCG GCGAAAAACA GATCAAAGCG
CTTCCCCTGA ACGGCCGCAG CTACGATTTG CTCGTTACGC TCAATCCCGG CATCGTTAAC
TTCACGTGGG AGAAAACGGG CGGCACCGGC GTTTCCAATT CCACGACCGG CAACAACTTT
TCAGTCTCGG GGAATCGTCC GCAGCAAAAT CTGTTCCTGA TGAATGGCGT CGAGTTTTCG
GGCGCGGCGG AAAACAACAT GCAGCCCGGC GGCCCGAGCG GCAACGTGAT CGGTGTGGAA
GCCGTCCGCG AATTCAACGT TCTTCGCGAC AACTACGGCG CGCAATACGG CAAGCGTCCC
GGCGGACAGG TCGTCATCGT CACGCAGTCC GGCTCAAATC AGTGGCACGG CTCCGCCTAT
GAATACCTGC GCAATAACGT CTTCGACGCC CCCAACTACT TTGATCAGGG CGACGCTCCA
CCGTTCCAGC GCAATCAGTT CGGCGCGTCG TTAGGCGGTC CGCTCCGCCA TGACCACACA
TTTTTCTTCC TGAACTTTGA AGGCGTAATC CAGAACTTGC ATCAGACCTC CGCCGCGTTC
GTACCCGGCC TCGACTCCCG CGCCGCCGCC GTTCCCAGCG TGCAGCCGTT GCTCAATCTC
TGGCCTACGC CATCGGCGAG CGCTCCGGAA TTCAACGGCA TCGCGCAGGT CTTCAGTAGT
CCTCTTCAAA CCATCCGCGA ATATTTCGGC AACGCGCGCG TAGACCACAC CTTCTCTTCG
CGTGATTCGC TCGCAGCGTC GTACGTCATC GACGACGGCC ACGATCTCAC CGCCACGGTT
GCCAATCCTT TTAGCTCCGA CATCCTCACG CTACGCGAGC AAGTGCTCAG CCTCAACGAG
ACGCACGTCT TTTCGCCGTC GCTCTTGAAC GTCGCGCGTG TCGGCTTCAC GCGCGCCGGA
TACTACTTCG CCGGTGAACC CACACCCGGA ACCCCCGCCG CCGATGTCCC CGGCTTCCTC
CTCGGACATC AGGTCGGCGC GGTTGTCGTC GGCGGCAGCG CCGCATCCAA TCCGCAGGCG
CAGGTCGGAT TAGCCGGCAG CAACAACGGC AGCAATCTCA GCATTGCGCG CAACATCTTC
ACGTACGAAG ACCAGCTCAG CCTCACCCGT GGCCGCCACC AGATCACCGT CGGCGCATGG
TTCCAGCAGT TCCAGTCGAA CGAGAACATC GCGCTCAGCC AGTACGGTCA GGCCACCTTC
GCGAGCCTGA CCACGTTCCT GCAAGGCACC ATCGGCACCT TCTTGTACGA TCCCGCGCCG
AGCAACATGA ACTGGCGTTC GCTCTTCGGC GCCGGGTACG CCGAAGATGT CTTCCGCGTC
TCGCCGAAGT TCACGCTCAC CCTCGGCTTC CGCGGCGAGT TCTCCACCGG ATGGAACGAA
GCTCACGGTC GCGCCGCGAA CTACACTTAC AACAACGGCA TCATCTCCGA TCAGCCGCGC
ATTGGCGATA GCGCCTTCAC CGAAAACAAT TACAAGTTCC TGGCACAACC GCGCGTCGGC
GTCGCGTACA GCCCGTTTAG CGGCACGGTG TTCCGCGCCG CGTTCGGCAT CTACAACGAA
CTCCAGGACG CCCTCGGCTA CCGCATGGAC CAGAACGCTC CGTTCAACCC GGTGTACAGC
CTCGCGAACT ATAAAGTCTC GAACCTGCCG CTCGATCCCA CCGCCCCTGT GCCTGCCGCA
GCGTTGCTAA TTCCCGGTGG CACCCAGCCC GATCTCAAGG CTCCCACGCT GTTCTCCTGG
ACCTTCGGCA TCGAGCAGGA ACTCTCGCAC AATACGTCGT TGAACCTGCG CTACGTGGGC
TCGCACGGAT ACCACGAACT CGTCGGCGTT GACGCCAACG TGCCCTCGAA TCCGATCATC
TGCCCCGCCG ATCCATGCCC CGCGGTCTAT CCGGCGACTT TCCCGGTAGG CCTCGCGGGA
ACGCCGATTC CCGCCGGAAG CTACTACATC CCGAAAGGCA CGCCGAAAGG GAATCCCACG
ATCAACAACA CGTGGACGTG GTTCTCTGTC GGTACGAGCA GCTACAACGC GCTGCAACTC
GACGTGAACC ATCGCTACAG CAACGGCCTC TCTCTGCGCG GCGTGTACAC ATGGTCGAAG
ACGCTCGACG ACGGTGACTC GCTCAACCAG ACCACCGCCA ACAACGCACC GGGGCTGGTG
TCGAATCCGT ATGACATCAA GGCCGACTGG GGACCTGCGA CCTATGACGT GCGCAATCTC
GGCGTAATCA GCGCCGTTTA CGAATTGCCA TTCGGACGCG GCAAGCGCTT CCTCGCGTCA
TCGAATTCGT TTGCCAACTG GACGGTCAGC GGATGGTCTG TGAACAGCAT CGCCGTAATG
CAGAGCGGCT TCCCATTCAC GCCGCAACTC AGCTACAACC CATCGAACAC CGGCGACACG
CGCAATCCCG TGCGGCCCTT TGCGAACCCG AACTTCACCG GACAGATCGT CACCGGCAAT
CCAATCCAGT GGTTCAATCC CGCCGCATTT CTGGCACCTC CGGCGAACAG CGGCTTCTGG
GGAAATCTCG GACGCAACAC GCTTACCGGG CCGGGGCTCG GTACGTGGGA CTTCTCCGCG
ATCAAAGATT CGAAAATCAA CGAGCGGATG AGCCTGCAAT TTCGCGCCGA AATCTTCAAC
CTGCTCAACC GCGCGAATTT CAATACGCCG AACTTGATCG TCTTTACGCC GACGGGCGTA
TCCGGAACCG CAGGCGCAAT CAGCAGCACG TCCACCACCT CGCGCCAGGT ACAGTTCGCC
CTCAAACTGC TTTTCTAG
 
Protein sequence
MSPLRTLCVV LLLSFSAFAQ VSANLSGFVT DPSGAAVAGA EVRAINNETG AVRVSATNAS 
GRYDIVALPV GQYEVHARKQ GFAEQVRTGI LLVVGQDATA DLKLRIGDVS EQVKVSADAE
MVGVTNQDIS GLIGEKQIKA LPLNGRSYDL LVTLNPGIVN FTWEKTGGTG VSNSTTGNNF
SVSGNRPQQN LFLMNGVEFS GAAENNMQPG GPSGNVIGVE AVREFNVLRD NYGAQYGKRP
GGQVVIVTQS GSNQWHGSAY EYLRNNVFDA PNYFDQGDAP PFQRNQFGAS LGGPLRHDHT
FFFLNFEGVI QNLHQTSAAF VPGLDSRAAA VPSVQPLLNL WPTPSASAPE FNGIAQVFSS
PLQTIREYFG NARVDHTFSS RDSLAASYVI DDGHDLTATV ANPFSSDILT LREQVLSLNE
THVFSPSLLN VARVGFTRAG YYFAGEPTPG TPAADVPGFL LGHQVGAVVV GGSAASNPQA
QVGLAGSNNG SNLSIARNIF TYEDQLSLTR GRHQITVGAW FQQFQSNENI ALSQYGQATF
ASLTTFLQGT IGTFLYDPAP SNMNWRSLFG AGYAEDVFRV SPKFTLTLGF RGEFSTGWNE
AHGRAANYTY NNGIISDQPR IGDSAFTENN YKFLAQPRVG VAYSPFSGTV FRAAFGIYNE
LQDALGYRMD QNAPFNPVYS LANYKVSNLP LDPTAPVPAA ALLIPGGTQP DLKAPTLFSW
TFGIEQELSH NTSLNLRYVG SHGYHELVGV DANVPSNPII CPADPCPAVY PATFPVGLAG
TPIPAGSYYI PKGTPKGNPT INNTWTWFSV GTSSYNALQL DVNHRYSNGL SLRGVYTWSK
TLDDGDSLNQ TTANNAPGLV SNPYDIKADW GPATYDVRNL GVISAVYELP FGRGKRFLAS
SNSFANWTVS GWSVNSIAVM QSGFPFTPQL SYNPSNTGDT RNPVRPFANP NFTGQIVTGN
PIQWFNPAAF LAPPANSGFW GNLGRNTLTG PGLGTWDFSA IKDSKINERM SLQFRAEIFN
LLNRANFNTP NLIVFTPTGV SGTAGAISST STTSRQVQFA LKLLF