Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2136 |
Symbol | |
ID | 4072378 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2552116 |
End bp | 2555490 |
Gene Length | 3375 bp |
Protein Length | 1124 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984151 |
Product | TonB-dependent receptor |
Protein accession | YP_591211 |
Protein GI | 94969163 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0917219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCATT GGCGGCTTCG CCTGGGCGCG TTGTTACTTG GTTGCTTGTG TGCGGGAAGT CTGTTCGCCC AGGAGATTAC GGGCGACATC CGCGGAATTG TGAAAGATGC TTCGGGCGCA CTCGTCGCCG GAGCCACCGT TGAGGTAACC AACACTGATC GCAACACGAC AATTCGTACC GTCACGACCG ATACCAACGG CAATTACGTT GCCGCTTACC TACCCGTCGG TCATTACAAG GTCTCGGTCA AGAAGGAAGG CTTCAAGGCA GCTGAGACCA ACAATGTGGT CTTGAACGTG CATGATCGCC TTACGGTGGA TGAAACGCTC CAGGTCGGCT CTAGCGGCCA GACCGTAACG GTCAACGAGA ATCCCAGCCA GGTGAATCTC GACAACGCGA CTGCCCAAGG CGTGATCACC GGCAACCAGG TGCGGCAGCT CACACTCGTC ACGCGTAATT ACGAGCAGTT GGTTGCAGCC CTTCCCGGCG TTTCGACGAA CCTCGCTTCC GATCAGCTGT TCGTCGGCGT AAGCAATCCG GTCGGCACCT CAAACCAGAT CAACTTTTCG ATTAATGGCA CCCGTCCAAC GCAGAACAAC TGGCAGATCG ACGGTTCCGA CAACGTGGAC CGCGGCGCCA ACCTGACGCT GCTCGCCTAT CCGAGCGTGG ATTCGATCCA GGAGTTCAAC GTCCTGCGCT CGAACTACAT GCCGGAACAA GGACGCAGCT CAGGCGGACA GGTCAACGTC ATCACGCGTT CCGGCACCAG CGCCTTCCAC GGCAGCGCGT ACGAGTTCTT CCGAAACGAT GTGCTGAACG CCAACAACTT CTTCAATAAT CGTGCCGACG TTGAACGCCC CGCGATGCGT TGGAACGACT TTGGCTTCAC CATCGGCGGA CCGATCTACA TTCCCGGCCA CTACAACACG GAAAAGAACA AGACGTTCTT CTTCTATTCG CAAGAGTGGC GAAAGATCAT CACCTACAAC ACGTTCACCA GCGGCGTGCT GCCCACGTCG GCAAACCTCG GAGGCGATTT CGGAAGCACG ATTTGCGTCG CTTTGAATCC CGATGGGACG TGCGCAGCGT TGGGCAATCA TGTCTCCACG ATTAGTCCCA CGGCGCAGGC ATACATCAAC GACATCTATT CAAAGTTCCC AGCGCCCAAC AATGCTGACG GAACGCTCAC CTGGGTAGGA CGCAACCAGT TCAACTATCG CGAAGAGAAC GTTCGCGTTG ACCACAATTT CTCGTCCAAG TTCAGCATCT TCGGACGCTA CCTCGACGAC CAGATCCCAA CGCAGGAGCC TGGCGGTCTG TTTACCGGTC TCGCCGTTCC TGGCGTTGCT GTGACCAACA CGAATGCTCC CGGACGCAAC GCCTCGATTC ACGCGACGAT TGCGTTCTCG CCCACGACGC TGGCGGACAT GGGCTATGCG TACTCGTATG GCGCGGTCAT CAGTTCGCCG GCGGGAACCA TGGCCTCAGC GAATTCGCCG GATGTGAATC CGACGCTTCC CTTCGGACTC GGTCCGCTGC TTCCGGGCAT CGGATTCTTC AATTCCACGC AGGGGCTCGC CGGATTCGGT CCATACAACG ACTACAACTA CAACCACAAC GCGTTCGCTA CGTTGACGAA GGTGATTGGA AAGCACTCGC TGAAATTCGG CGGGACCTTC AACTACTACA CCAAGGACGA GAACGTGAAT GGCTACGGGC TGCAATCGGG CTCCTACACG TTCGCGGATT GCGTGGATAG CAGTGCTACC GTCACCAGCC CGTATCCCTG CTCCGACACC GGCAGCACCG ATCAGGAGTG GGCGAACTTC CTCAACGGCA ACGTGTCGTC GTTCAACCAG ACAAACATTG ACTTCCGCGC GCTGGTACAT CAGCACCAGT GGGAATTCTT TGGTCAGGAT GAGTGGCGTC TCACACCGTA CTTCACACTC AGCTACGGCG TGCGCTACTC GCTCTTCCAG GCGCCCACCT ACGGCAACGG CCTGCTTACG ACCTTCGATC CGTCGAAGTT CGATTCCACC AACACGCCCG CGATCGACAG CAACGGTCTT TACGCGGCCG TGCCATCTGC GCCGTATACC AATGGCATCC TGATCGGCGG CAAGGATTCT CCGTATGGCG ATGCCGTGAA CCGCACGCCG AAACTCAACT TCGCGCCGCG CTTAGGGTTT GCATGGGACC CGACGCATAC CGGCACGACT TCTATCCGCG GCGGATTCGG ACTGTTCTTC GATTCACCTG CCGTGAACAG CATGGAACAG TTCCAGCCGG GAAATCCGCC GTTCGTTACT TCGACCTCAA TTTCGAACAC CAATTTCGAC AATCCGGGTT CAGTACAGGC GGCGCCAAAC CTGTCACCTC CCGACATTGG CGGCATCGCT CCTAACTGGA AGCAGCCGTA CACGATGATG TGGAGCCTGG ACGTGCAGCA CCAGTTCACG CCGTCCACCA TCTTCGACAT TGGCTACTAC GGCAACGCGG GACGCCATCT TATTGGCGTT GTAGACGTAA ACCAAGCGCC ACTCGGCGGC TTCCAAGCCC TCGGCATTCC GGGGCCGGTC AGTTCCGGTG ACACGCAGAA GCTCAATCAG ATCCGTCCGT ACCAGGGCTA TGCGTCAATC GACTTGTTCT CGCCGGTATT TACGTCGAGC TACAACGGCC TGCAGACGTC GTTCACCAAG CACTTCACCG AGAATTCGAT GATCGTGCTG AACTACACCT GGTCGCACGC TCTGGGCACA GCTTCGAGCG ACTACCGTGC GCCGCAGTAT TCCATGGATA TTGGCGCGGA ATACGGCAAC CTCGACTACG ACCGTCGCAA CATGTTCACC GCCAACTATG TGTACGACCT GCCGTTCTTC AAGCACCAGC AGGGGGTTGC GGGACACGTG CTCGGCGGTT GGGAAGTCTC CGGATTGTTC TATGCGTATA GCGGGGCGCA CTACACCGCG AGCGCATCGC GCGATCCCGG CGGCCTCGGC TTGCGTGATC CGAACACCTT CGAGGGCGGG CGCCCCGACC TCATTGGCAA CCCTCAGCAG GGCGCACCGA ACCATCTCGA CAAGTGGTTC AATACCTCAG CGTTTGCGCT CGTGCCAGCC GGCGACGTCC GTGTGGGTAA CGAGCCGCGC GGCGTCATCG TGGGGCCGGG CTACTTCCGT TGGGATGCTT CGCTGTTCAA GAACATCAAG TTCACCGAAC GCTTGAACTT GCAGTTCCGT GCGGAAGCTT TCAACGTGCT CAACCACACG AACTTCAACG CTCCCAACGT CAGTGCGACG AGCTCGCTCT TCGGACAGAT ACTGTCCGCA CGCGATCCTC GGCAGCTACA GCTTGCCCTG AAGTTGACCT TCTAA
|
Protein sequence | MSHWRLRLGA LLLGCLCAGS LFAQEITGDI RGIVKDASGA LVAGATVEVT NTDRNTTIRT VTTDTNGNYV AAYLPVGHYK VSVKKEGFKA AETNNVVLNV HDRLTVDETL QVGSSGQTVT VNENPSQVNL DNATAQGVIT GNQVRQLTLV TRNYEQLVAA LPGVSTNLAS DQLFVGVSNP VGTSNQINFS INGTRPTQNN WQIDGSDNVD RGANLTLLAY PSVDSIQEFN VLRSNYMPEQ GRSSGGQVNV ITRSGTSAFH GSAYEFFRND VLNANNFFNN RADVERPAMR WNDFGFTIGG PIYIPGHYNT EKNKTFFFYS QEWRKIITYN TFTSGVLPTS ANLGGDFGST ICVALNPDGT CAALGNHVST ISPTAQAYIN DIYSKFPAPN NADGTLTWVG RNQFNYREEN VRVDHNFSSK FSIFGRYLDD QIPTQEPGGL FTGLAVPGVA VTNTNAPGRN ASIHATIAFS PTTLADMGYA YSYGAVISSP AGTMASANSP DVNPTLPFGL GPLLPGIGFF NSTQGLAGFG PYNDYNYNHN AFATLTKVIG KHSLKFGGTF NYYTKDENVN GYGLQSGSYT FADCVDSSAT VTSPYPCSDT GSTDQEWANF LNGNVSSFNQ TNIDFRALVH QHQWEFFGQD EWRLTPYFTL SYGVRYSLFQ APTYGNGLLT TFDPSKFDST NTPAIDSNGL YAAVPSAPYT NGILIGGKDS PYGDAVNRTP KLNFAPRLGF AWDPTHTGTT SIRGGFGLFF DSPAVNSMEQ FQPGNPPFVT STSISNTNFD NPGSVQAAPN LSPPDIGGIA PNWKQPYTMM WSLDVQHQFT PSTIFDIGYY GNAGRHLIGV VDVNQAPLGG FQALGIPGPV SSGDTQKLNQ IRPYQGYASI DLFSPVFTSS YNGLQTSFTK HFTENSMIVL NYTWSHALGT ASSDYRAPQY SMDIGAEYGN LDYDRRNMFT ANYVYDLPFF KHQQGVAGHV LGGWEVSGLF YAYSGAHYTA SASRDPGGLG LRDPNTFEGG RPDLIGNPQQ GAPNHLDKWF NTSAFALVPA GDVRVGNEPR GVIVGPGYFR WDASLFKNIK FTERLNLQFR AEAFNVLNHT NFNAPNVSAT SSLFGQILSA RDPRQLQLAL KLTF
|
| |