Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4190 |
Symbol | |
ID | 4072149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4959572 |
End bp | 4962916 |
Gene Length | 3345 bp |
Protein Length | 1114 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986221 |
Product | TonB-dependent receptor |
Protein accession | YP_593264 |
Protein GI | 94971216 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.351423 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAACCAC AATTTGGTAA GCGAAGAGCG GCATTCGTTA GTGTTCTGCT ATTCGCACTT ATCTTTGCTG TTTCTGCGTG GGGCCAAACG AGTAAAGGAA TCATCGCTGG AACAGTCACA GATACGAGCG GAGCCGTTGT CGCCGGAGCA CAGGTAACAG CGTCAAACAC GGATACCGGC GAAACACGCA CCGTTCAGTC GGGACCGACG GGTGCGTACC GCGTTGAGGC AGTCACACTC GGCAAATATC GCATCAACGT ATCGTTCCAG GGTTTTCAGA GCCAGGTTAT AAATGGCGTT GAGGTTGCCG GATCGGTGAT GACTGCGGTT GATATCAAAC TGCAAGTCGC CAGCGCCGCC AGTACAGAAG TGACGGTATC GGCGGACAAC AACCAGGTGC AGACCGAAAA TGGCGAGATC TCCAGCACGA TCACTACGAA AGAATTGACC AATCTACCGC TCGCCTCCCT GAACCCGATC GAACTAGCAC TGACGCAGCC CGGCGTTATA GACAATGCCG GCCGTGGAAG CACGAACGGC ATGGGTTTCT CGGTGAACGG TTCACGTCCA CAGAGCAACA ACTTCCTGAT TGATGGTCAG GACAATAACG ACAACAGCAT TCAGGGCCAG GCGTATCAGC CACAGAATCC GAACGCAGTG CAGGAAGTCG CCATCATGAC GAACTCCTAC TCCGCGGAAT TCGGTCGTGG CGGAGCTTCG GTCACCAACG TTATTTACAA GAGCGGCTCC AATCAGTTCC ACGGAACGCT GAGCGAACTC TACTCGGGCT CCGGATTGAA CGCGATCGAC GCCGCCAACG GCCTTACTGG AATGAAGGAC GGCGGCGATT GCAACCGAGC GCCGAATTTC TTCCCATGTA AGACGCGTTT CGACTCCCAC ACCTTCGGAT TTACGTTCGG CGGTCCGCTC ATTAAGGACA AGCTGTTCTT CTTCGGCAGC GGAAACTGGG AGCGCACCTA CGGTCAGGAG CAGAACAGCA ACATTCTGAT CCCCACTGAA ACCGGCATCG GCCAGTTGCA GGCGTATGGA TCGGCGAACG CGACTCTTCT GACCCAATAC CTCGGATCTC TCCGCGGATC TACGGTTGGA GCGACGTGCG TCAACACTGG CATTTCGTCG TTGCCTTGCG TAGAGATGGG CAATTACACG CCGACCGCGC CGCAGCAAAA CACTGACACG CAGTGGAACA TCAAGGCGGA CTATCTGCCG CGGCAGACCG ACACCATTAC CTTCAACTAT CTCCACGACC GCGGCTACTT CGCGCCTGAC TGGTTCGCGA ATCCGGGTTC CGTATTGCCA GGCTTCGAAA CCTTCCAGGG CGGACCGTCG TGGATCGCGG GCGCTTCGTG GACGCACACC TTTAGCTCGA ACAAGGTGAA CGAATTCCGC GCGTCCTACG GACACCTCGG CTTCACGTTC GGACCTACGG CCGCAACTAC AGGGAATCCG CTTTATCTAA TGCCGAGCCT GGCACTGAGC AGCGCCGGAT TGGATGCATT CCCGCTATTG GGTACTGACT CAGCTTTCCC GCAGGGTCGT AGCCACCGCA CCATGCAGTT GCAGGATGGC TTCACCATTA CGAAGGGCAG CCACACCTTC AAGATGGGCG CGGACGTCGC GCGCATCTGG GTGACGGACC AGATTCCAAT CAACACCCGC GGAACTCTGA CCTTCACCGA TGGTGGCGGC TACACCGCAC TCGGCAACTT CCTCGATAAT TTCACCGGCA CCAGCGGCCA GGCGTTGGAT CTCCAGACAG GTAATCCTAC GGTGAAGCCC ACGTTGCTGC AGAGCGGCTA CTACTTCCAG GACAACTGGA AGGTGCGGCC GAACCTGACG TTGAACCTCG GCGTCCGTTA CGAGTATCAA ACGAATCCCG AGAACTCACT CCCGTACCCG GCGGTGTCGA ACATCTACGG CGGCGATAAC AATTTCCCGA CGGTGGTGAA GGCGGACCAG CAGTTCTCGC ACATCGCTCC TCGCATCGGG TTTGCCTACT CGCCAAACTT CCTGCCAAGC ATCTTCGGAA ACGGCAAGAC CGTTCTCCGC GGCGGCTTCG GCATTTTCTA TGATGCGATC TACACCAACA TTCTCGATAA TACGGCCTCG TCCGCACCGA ACTCGATCGA CGAGCCGCTG TTTGGCTCGG ACTCTTCGAA CGCGCGCGGT TTCGCCAACG CAACCGGACA GTTCAGCGGA CTCTCAGGTA CCTTGAGCCC GTTCAATACC GTTACAAGCA TTGCGAAGGA CCTCACCAAT CCGCGTACCA CGCAGTGGAA CTTCAACATT GAGCGTGAAC TGCCGGCAGA CATGCTCCTG ACGGTCGCCT ACCTGGGATC GCGTGGACAG AAACTGTTGG TGAATGACGA CTACAACCCC TTCGGCGGCT ACGATGCCAG CGGCAACTAT ATTCCGCGCT TCAATTCGAA CCGCGGTGCC ATGGCAATTC GCACCAATGG CGGTGATTCC TACTACAACG GCCTGGCAGT TACGGTCGAG CGCAGGTTCT CTCATGGCTT GATGCTGCGC AGCGCATATA CGTTCTCGAA GTCCATTGAT GACAGCTCGA ATATTTTCGT CATCACGGGC GGATCGTCGT ATGCGCAGGA CCCAACCAAC CGCCAGGCGG ATCGTGGTTT GTCAGCCTTC AACGCGTTCC ATCGTTGGGC GTTTACCTAC GTGTGGGACG TACCGGGCTT CAAGGCGCAG GACAACAAAG TGCTGAACGG CCTCGCGTAC CTCTCCCGTC ATTGGGAATG GACCGGGACC ACGACCCTGC AGTCTGGCTT GCCTGACACG ATCTACGATA GCTTCGATAG CAGCGGTCGC GGCCACAGTT CGAGTGGACG TCCGGACCTG TTGAACGCTT CGGCGCCCAT GAATGCGATT GCTTTGCCGG CAGCGTGGGG AGCTTGCGCA CCGGGCGCGG ATTACTGCGA CCTGGCGACG CTGAACTCGC TCTCTGCCAG CGACCTGAGC AACTATCATT TCCTGGTTCC GTTTGGCGCG CCGGGTACGG TGGGACGCAA CAACTACATC CTGCCTGGCC AGGTGAACTT TAACTTCGGA ATCAACCGCA ACATCCCGAT TCCGCGTCAC GAATCGCAAG TGGTGACGCT GCGCGTGGAA ATGTATAACC CGTTCAATCA CCCGAACCAG TCGGCACTTC CGAACCAGGG CATGTGGAGC ACGAACGTCT CTGACATGTT CCTGGTTGGA CAGGACTCGA GTGGGAACCT CAACAACCCG AGCCACCTGT TCGACACCTA CTGGGCACGC CAGGGTGCGC GCAATATCAA GCTGTTGATC AAGTACCAGT TCTAG
|
Protein sequence | MQPQFGKRRA AFVSVLLFAL IFAVSAWGQT SKGIIAGTVT DTSGAVVAGA QVTASNTDTG ETRTVQSGPT GAYRVEAVTL GKYRINVSFQ GFQSQVINGV EVAGSVMTAV DIKLQVASAA STEVTVSADN NQVQTENGEI SSTITTKELT NLPLASLNPI ELALTQPGVI DNAGRGSTNG MGFSVNGSRP QSNNFLIDGQ DNNDNSIQGQ AYQPQNPNAV QEVAIMTNSY SAEFGRGGAS VTNVIYKSGS NQFHGTLSEL YSGSGLNAID AANGLTGMKD GGDCNRAPNF FPCKTRFDSH TFGFTFGGPL IKDKLFFFGS GNWERTYGQE QNSNILIPTE TGIGQLQAYG SANATLLTQY LGSLRGSTVG ATCVNTGISS LPCVEMGNYT PTAPQQNTDT QWNIKADYLP RQTDTITFNY LHDRGYFAPD WFANPGSVLP GFETFQGGPS WIAGASWTHT FSSNKVNEFR ASYGHLGFTF GPTAATTGNP LYLMPSLALS SAGLDAFPLL GTDSAFPQGR SHRTMQLQDG FTITKGSHTF KMGADVARIW VTDQIPINTR GTLTFTDGGG YTALGNFLDN FTGTSGQALD LQTGNPTVKP TLLQSGYYFQ DNWKVRPNLT LNLGVRYEYQ TNPENSLPYP AVSNIYGGDN NFPTVVKADQ QFSHIAPRIG FAYSPNFLPS IFGNGKTVLR GGFGIFYDAI YTNILDNTAS SAPNSIDEPL FGSDSSNARG FANATGQFSG LSGTLSPFNT VTSIAKDLTN PRTTQWNFNI ERELPADMLL TVAYLGSRGQ KLLVNDDYNP FGGYDASGNY IPRFNSNRGA MAIRTNGGDS YYNGLAVTVE RRFSHGLMLR SAYTFSKSID DSSNIFVITG GSSYAQDPTN RQADRGLSAF NAFHRWAFTY VWDVPGFKAQ DNKVLNGLAY LSRHWEWTGT TTLQSGLPDT IYDSFDSSGR GHSSSGRPDL LNASAPMNAI ALPAAWGACA PGADYCDLAT LNSLSASDLS NYHFLVPFGA PGTVGRNNYI LPGQVNFNFG INRNIPIPRH ESQVVTLRVE MYNPFNHPNQ SALPNQGMWS TNVSDMFLVG QDSSGNLNNP SHLFDTYWAR QGARNIKLLI KYQF
|
| |