Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2071 |
Symbol | |
ID | 4069922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2482142 |
End bp | 2484031 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984086 |
Product | TonB-dependent receptor |
Protein accession | YP_591146 |
Protein GI | 94969098 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00162344 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000156466 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAGTT CGATTGTACT GTTATTTCTT TGCATTTCTT TAGGTGCTTT GGCGCAGCAA GGGCCTACCC CTCCGCAAAC GGCAGCAGGC GGACCTCCAC CCACGCAGCC GCAGCCCGAC GAAGTGGTGG TCGTGACCGG AACCTGGGAG CCGATGCCGC TCGAAGATCT CCAACGCTCG GTGCAGTCCG TTGATGTGCA GGCTGCTCCG CTCTTGTTCT CGAGCACTGC GCAGTTTCTG CAACTCGATC CGTCGGTAGA TGTGCGGCAG CGCGCTCCGG GCGGTGACCA GGCAGACCTG TCGATCCGGG GCTCGGCCTT CGAGCAATCG CTGGTGCTGA TTGACGGTCT CCGCGTGAAT GACGCTCAGA CCGGCCACCA CAACCTCGAC CTCCCGATTC CGCTCGACAC TATCAGCCGC ATCGAAGTCC TGCATGGCGC GGGTTCGACG TTCTATGGCG CCGATGCGCT GGGCGGCGCG GTGAACTTTA TCACCGCTCC GGCTGCGACC AGCGAACTAC GCTTGCGTGC GGGTTTCGGC AACTTCGGCT ACAACGAGCA GCGTGCCGTC GCTTCCCACG CGACGAAGAA CTTCAGCGAG CAGCTCATCG GCGATCGCAG CTTCTCGACG GGCTTCATAG AAGACCGCGA CTTCCGGAAC GCGGCCGTGT CGAGCGAGAC CCATTTCCAT ACCGCGCTCG GCGACACAAT GTTTCTCCTT GCGACCTCCG ACCGACCCTA CGGAGCGAAC CAGTTTTACG GTCCGTTCGA TTCGTGGGAG CGAACCAAGG CGTGGTTCGT GGCGTGGACA CAGGACCTCG GCAAGCAGAC GGCTTTCGAC TTTGGTTACC GCCGCCACAC CGATGAGTTT GTGCTGCTCC GCGAGGCACC CAGCGTTTAT GAGAACAACC ATGTGACCGA TAGCTGGCAG GGTGCCCTTC GCCGTCACGA CGAAATTGGC AAGGTCACGA CGATTTCTTA TGGCGCGGAA GGCTATCGCG ATCAGATCGA CAGCAACAAT CTCGGATATC ACGGCCGCAA TCGCGGCGCA GTGTATGCCG CCGCCGATTT CCGAATGATC AAGCGCTTCT CGCTCTCCGT GGGCGCTCGC GAGGAGTCCT ACAACGGGAC CAAGGGACAG TTCACACCGT CGGTGAGTGC GGCGTACTGG TTCGCGCCGT CATTCAAAGT AAGGGGCGCA GTGAGCCGCG GCTTCCGTAT TCCAACCTAT ACCGATCTTT ATTACAGCGA TCCCGCCAAT GCAGGAAACC CTAACCTTCG TCCGGAGTCG GCGTGGAGCT ACGAAGGCGG CGTCGATTGG AATGCGGGCG GCAAGATAGC TCTGACGGCG ACAGTATTCC ACCGCCGCGA GCATGACGGC ATTGACTACG TGAAGTGCGG CTCCGGCTTT ACCTTCGACA TCAATACCGG CACCTGCATC GCAAGCGGAG TACCGAACGA CGTTTGGCAT GCCTACAACA TCGACAGCCT GAACTTCACC GGCTTCGAGA CCCTTCTTCG CTATCGTCTT ACGCAGCGCC AGGAGTTCAC CGTGGGTTAT ACCGGCATTC ACGGTTCGCA GAATGCCGCA CCCCGTGTGC AGTCGCAGTA CGTCTTCAAC TATCCCGTGA ACAATACTTA CGTAGGATGG CAGGGAAGTG TGTGGCGAGG GATCATCGCG CGGACGCGTC TCGGCGTGAC CCAACGCTAC GCGCACGATC CCTATGCCCT TTGGGACTTC TCTGTAGCGA GGGAAGAGGG ACGTATTCGG CCCTACCTGC AGTTCACAAA TCTAACCAGC ACGACCTATC AGGAAGTCGA TGGCGTCGCG ATGCCGGAGT TCGGCGTGAT CGGTGGCGTA GAGATCGCGG TCTTCGGCAA GAAGCGTTAA
|
Protein sequence | MRSSIVLLFL CISLGALAQQ GPTPPQTAAG GPPPTQPQPD EVVVVTGTWE PMPLEDLQRS VQSVDVQAAP LLFSSTAQFL QLDPSVDVRQ RAPGGDQADL SIRGSAFEQS LVLIDGLRVN DAQTGHHNLD LPIPLDTISR IEVLHGAGST FYGADALGGA VNFITAPAAT SELRLRAGFG NFGYNEQRAV ASHATKNFSE QLIGDRSFST GFIEDRDFRN AAVSSETHFH TALGDTMFLL ATSDRPYGAN QFYGPFDSWE RTKAWFVAWT QDLGKQTAFD FGYRRHTDEF VLLREAPSVY ENNHVTDSWQ GALRRHDEIG KVTTISYGAE GYRDQIDSNN LGYHGRNRGA VYAAADFRMI KRFSLSVGAR EESYNGTKGQ FTPSVSAAYW FAPSFKVRGA VSRGFRIPTY TDLYYSDPAN AGNPNLRPES AWSYEGGVDW NAGGKIALTA TVFHRREHDG IDYVKCGSGF TFDINTGTCI ASGVPNDVWH AYNIDSLNFT GFETLLRYRL TQRQEFTVGY TGIHGSQNAA PRVQSQYVFN YPVNNTYVGW QGSVWRGIIA RTRLGVTQRY AHDPYALWDF SVAREEGRIR PYLQFTNLTS TTYQEVDGVA MPEFGVIGGV EIAVFGKKR
|
| |