Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3909 |
Symbol | |
ID | 4072246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4624580 |
End bp | 4628041 |
Gene Length | 3462 bp |
Protein Length | 1153 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985935 |
Product | TonB-dependent receptor |
Protein accession | YP_592983 |
Protein GI | 94970935 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000652763 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00581227 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGTCTT TCCGCCACGT AGCGATAAGC GTTTTCCTCT TTGTTGTTCT CACAGGCATC GGCCTGGCGC AGAACACCAG CCAGATCACC GGTTCGGTCC GCGACTCAAG TGGCGCGGCC GTACCCAATG CCGAAGTGGT CGTGAGCAGT CCGGAGCGCG GTATCGAGCG CCCTACAAAG ACCAACGACG CCGGTGAATA TGCAGTCAGC GGCATTCCCG CCGGTTCCTA CAACTTGAAA GTTACGGCGC AGGGCTTCAA GTCCTACGAA GCGAAAGGCA TCGTGCTTCG CGTGGCGCAG AAGACACGCG CTGACGCCGA CCTTCAGATC GGCGGCACGA CGACCGAAGT CACCGTAGCC GGCGAGAGCA TCGGCCAGGT GGAAACGCAA TCGTCGGATA TGTCGGGCGT CGTCACCGGC AAAGAGATTT CGCAACTTCA GTTGAATGGA CGCAACTTCA CGCAGCTCGT GACCCTCGTC CCCGGCGTCA GCAATCAGAC CGGACAGGAT GAAGGCACCG TGGGCATCGC TGGGAACGTC TCTTTCAGCT TCAATGGCGG CCGCACTGAG TACAACAATT GGGAGCTCGA CGGTGGCGAC AATATGGACA ACGGGTCCAA CGCCACCCTT AACGTTAACC CAAGCTTGGA CTCCATCGCC GAAGTCAAAG TGCTCACCTC GAACTACGGC GCGCAGTATG GCCGCAGCGG TTCTGGCACG GTCGAAGTCG AGACCAAGTC TGGTACCAGC AGCTTCCACG GCGACGCATA CGAATTCGTT CGCAACGACG CGTTCAACGC GAAGAGCTAT TTCTCTTACA CCGAGCCGCT GATCCCGGCT TATAAGAAAA ACGACTACGG CTACACTCTC GGCGGACCGA TCTTCATTCC CGGGCACTAC AACGAGAGCA AGCAGAAGTC TTTCTTCTTC TGGTCGCAGG AGTGGCGCAA AGAACGCGTT CCGGCGCCGT TCAACATTCC GGTGCCGTCA GCCGCGGAGC GCGCCGGAGA CTTCAGTGAC CAGTGCCCCG GCAACTCCTG TCCGCATATG GCTGACGGGA GCCCGTATCC CGGCAACATC GTTCCGATTG ATCCGACGGG AAGTGCGCTT CTGGCGCTGA TACCGGGGGC GAATCTCGGC TCAGGAGCGA GTTCGGTGTA CAACGCTTCG CCGACACAGC CGACCTACTG GCGCGAAGAA CTCTTTCGCA TCGATCACAA TATCAACGAC AAGTGGCACG TGACGTTCCG CTACACCCAC GATAGCTGGA ACACAATCAA CCCAACGTCA CAATGGACCG GTAGCGCTTT CCCAACGGTG CAGACGAATT TCGTCGGCCC GGCAATCAGC ATGGTGGGGC GTGTCACAAC GACCTTCACG CCGACGCTGG TCAACGAGTT CGTGATGAGC TACACCACCG ACCACATCAC GTTCTCTTCG ACCGGAACCC CGAATCCGAA TGCCTGGCAG CGACCGCAGG ATCTCGCCAT GGGCTATCTC TTTAACAACG GTTTTGGTGG AAAGCTTCCG GCGATCACCG TCTCCGATCC TGCTTACGGC GGAGGCTTCT ACGAGGACCC GAACGGCGAA TGGCCGGAAG GCGCGTACAA CTCGAACCCG ACTTACACCT TCCGCGACAA CTTGAACAAG ATCATCGGAA GACATAACCT GCAGTTCGGT GCGTACTACG TTGCGGCACA GAAGAACGAA CTCAGCGGCA TCCTGGTCAA TGGATCGCTC GGCTTCGACA GCACGTCGGC GGTTTCAACC GGGAATGCCT TTGCAGACAT GCTGACCGGA AATATCGCGA GCTTCTCGCA GGGCAGCGAC AACATCAAGT TCTACAACCG CTACAAGATC CTCGAACCCT ACTTCCAGGA CGACTGGCGC GTCACGCCGA AACTCACGTT GAACCTCGGA ATCCGTCTCA GCGCGTTCGG GACTTACCGC GAGAAGGACA ATCACGCCTA TAACTGGGAC CCGAAAGCCT ACGATCCAAC CTCCGCTCCG GTGTTCAATG CTGATGGTTC AGTGAGCGGC GGCAACATTT ACGACGGGCT TGTGCAGTGC GGCAAGAGCA GCGTGCCGGA AGGCTGTATG TCCGGCCATC TGTGGAACTG GGCTCCGCGA GTGGGCTTCG CTTGGGATCC GTTCGGCACC GGCAAAACTG CTGTTCGCGG CGGCTACGGG ATCTTCTACG AGCACACCAA CGGCAACGAA GCCAATACCG AAGGCTTGGA AGGGCAGTCG TCTCCGCTGA TCCAGACCGC TTCGCAGTCG AGTGTGGTTG GGTACACCAA TCTTGGCGTC GCCGCCGGGC TTGACGCGCA GTTCCCGCTG AGCTTCATCT CCGTTCCCAC GAGCGCCACA TGGCCGTACA TGCAGCAATG GCACTTTGAT ATCCAGCACG AAATCATGAA GGACACCGTG CTGGTTGTGG CCTACGTCGG CAGCAAGGGC ACCCACCTCG GCCGGCAGTC GGACATCAAC CAACTTCTCC CGACGCCGCT CGCCGACAAT CCATTTAAGG CAGGCGAGGT CATCACCTCG GATGTCTGCA ACAACATGAT GACGCCTAGC GGCGTTGCCG TGACCGGGCA GGCAGCAACC AATCTCGCGG TCGCTTGCGG CGCTGATGCC AACCCATTTC GTCCGTACCT GGGCATCGGC ACCATCACCC GCTTGGAGAA CGAGTCGGGC TCCACGTATC ACGCCTTCCA ACTCGCAGCA CGTCGCAACG TTGGACAGTT ACAGTTGAAC GTCGCTTACA CCTGGAGCCA CTCCATTGAC GACGCTTCCG ACCGCTATGA CGGGTCGTTC GTCGATGCCT ATGATCCGCG CCTGAATCGC GCCAGTTCGA GCTTCGATAT TCGGCACATG CTTAACGTAG GCTACGTTTG GGACATGCCG TTCTTTAAGG ACCGTGGCTG GAAGAATATC CTGCTCGGTG GCTGGGAACT GTCTGGCATT ACCAGCTTCC AAACCGGCAC ACCGTTTAGC GTGCCGAACG GCGGCGCTTA CGGTGACAAC GCTGGGGTCG GCAATGGCGT CGGTACCGGT TCGTATGCGG ATGTCGTCTC GGATCCGTAC TCGAATATCC CCGGCGGAAA CGGCGCATTC CTTGGGCCGC TCGTCGGGAA CCCGGCGGCG TTCGCACAGC CGACAGCACT TACGTTCGGA AACTCGGGAC GCAATTACCT GCGTAACCCG GGCTACACCA ACTGGAACAT GTCGCTCTTC AAGAACTTCA AGCTCAGCGA GCGCTTCAAT CTCCAGTTCC GAAGCGAAGC CTTCAACATC TTCAACCACA CCGAGTGGGC TTCGGTTGGC GGCGACGCCG GCTCCGCTGC CGGCAACGGC CTGCAGTCCT ACACCAACTC CTTCGGAGGA GACAATTTCC TGTACATCGG AGCTGCCCAT CCGCCGCGCA TTCTGCAACT CGGTTTGAAA CTTGTCTTCT AG
|
Protein sequence | MKSFRHVAIS VFLFVVLTGI GLAQNTSQIT GSVRDSSGAA VPNAEVVVSS PERGIERPTK TNDAGEYAVS GIPAGSYNLK VTAQGFKSYE AKGIVLRVAQ KTRADADLQI GGTTTEVTVA GESIGQVETQ SSDMSGVVTG KEISQLQLNG RNFTQLVTLV PGVSNQTGQD EGTVGIAGNV SFSFNGGRTE YNNWELDGGD NMDNGSNATL NVNPSLDSIA EVKVLTSNYG AQYGRSGSGT VEVETKSGTS SFHGDAYEFV RNDAFNAKSY FSYTEPLIPA YKKNDYGYTL GGPIFIPGHY NESKQKSFFF WSQEWRKERV PAPFNIPVPS AAERAGDFSD QCPGNSCPHM ADGSPYPGNI VPIDPTGSAL LALIPGANLG SGASSVYNAS PTQPTYWREE LFRIDHNIND KWHVTFRYTH DSWNTINPTS QWTGSAFPTV QTNFVGPAIS MVGRVTTTFT PTLVNEFVMS YTTDHITFSS TGTPNPNAWQ RPQDLAMGYL FNNGFGGKLP AITVSDPAYG GGFYEDPNGE WPEGAYNSNP TYTFRDNLNK IIGRHNLQFG AYYVAAQKNE LSGILVNGSL GFDSTSAVST GNAFADMLTG NIASFSQGSD NIKFYNRYKI LEPYFQDDWR VTPKLTLNLG IRLSAFGTYR EKDNHAYNWD PKAYDPTSAP VFNADGSVSG GNIYDGLVQC GKSSVPEGCM SGHLWNWAPR VGFAWDPFGT GKTAVRGGYG IFYEHTNGNE ANTEGLEGQS SPLIQTASQS SVVGYTNLGV AAGLDAQFPL SFISVPTSAT WPYMQQWHFD IQHEIMKDTV LVVAYVGSKG THLGRQSDIN QLLPTPLADN PFKAGEVITS DVCNNMMTPS GVAVTGQAAT NLAVACGADA NPFRPYLGIG TITRLENESG STYHAFQLAA RRNVGQLQLN VAYTWSHSID DASDRYDGSF VDAYDPRLNR ASSSFDIRHM LNVGYVWDMP FFKDRGWKNI LLGGWELSGI TSFQTGTPFS VPNGGAYGDN AGVGNGVGTG SYADVVSDPY SNIPGGNGAF LGPLVGNPAA FAQPTALTFG NSGRNYLRNP GYTNWNMSLF KNFKLSERFN LQFRSEAFNI FNHTEWASVG GDAGSAAGNG LQSYTNSFGG DNFLYIGAAH PPRILQLGLK LVF
|
| |