Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0156 |
Symbol | |
ID | 4070068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 163915 |
End bp | 165216 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982156 |
Product | HipA-like protein |
Protein accession | YP_589235 |
Protein GI | 94967187 |
COG category | [R] General function prediction only |
COG ID | [COG3550] Uncharacterized protein related to capsule biosynthesis enzymes |
TIGRFAM ID | [TIGR03071] HipA N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0383905 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00201616 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCCGTC ACGCTACCTA TGCCCCTCTA AACGTTTTCA TCAACTCGCG ACGGGTCGGA GTTCTCCGTC GCGAGTCCTC AGGCGCCGTC GAATTTCGCT ACAGCGACGA GTGGCTCGCG TGGGCGCACG CATTTCCGAT CTCACTATCT CTGCCTATGC GTCCGGAGAA ATACGTTGGC GCTCCGGTTC TCGCAGTCCT GGAAAATCTA CTGCCGGACA ATGAAGCGAT CCTGCGACGA GTCGCTGAGC GGGTGCACGC GAACGGAACC GACGCTTATA ACTTGCTTTC AGCAATCGGA CGCGACTGCG TCGGGGCACT GCAGTTTCGA CCGGAAGACA CACAACCTGG CCCTGCAGGA GTCGTTGATG GAGAGGAACT CGAGGGTCAC GGAATTGAGC AACTACTCGC AAACCTAACG CGCGCGCCTC TAGGCCTTAC GACCGACGAT GATTTCAGGA TCTCTATCGC CGGAGCCCAA GAAAAGACTG CTCTTCTCCG CTGGAATGAT CGATGGTGGA AACCGCTTGG CGCCACGGCG ACCACACATA TTTTCAAGCG GCCGATAGGA ATGGCGCACA ATATTGACCT CAGCGATAGC GTGGAGAATG AGTATCTCTG TCTCCGACTC ACGAAGGCAT TGGGAATTCC TGTTGCGAAT GCGGAAATCG AGCAGTTTGG AAAACAGAAG GCATTGGTGA TTGAGCGCTT CGATCGTCTT TGGACGAGCG ACAAGAGGTT GTTGCGCATT CCGCAAGAAG ACTGCTGCCA GGCATTCTCG ATGCCACCAA CCAAAAAATA CGAAGCCGAG GGCGGTCCCG GAGCCGTGAA AATTCTTACC CTCCTCAATG CTAGTGACAC GCCTACCGAC GACCAGAAGA TGTTCTTGAA AGCACTGATA ACTTTTTGGC TACTCGGAGC GATCGACGGC CATGCAAAGA ATTTCAGTAT TCGAATCGCC GAAGGCGGAC GTTTTCAATT GTCTCCCCTT TACGACATCG TCTCCGGCCA GCCGGCTGTG GCAGCGGGCA GCATCCGCCA CAACCAATTC AAACTAGCGA TGGCGGTTGG GAAGAATCGG CAGTACGTGG TGAACTCAAT CGCACCCCGG CACTTCGCCG AAACGGCTGC ACAGGCGGGC GTCGGGCGAG TGGTCGTGGA GGAAGCGATG CAGGAGTTAC ATTCGGCGGC GGTGAAGAAT GTGAGCCACG TTTTCAAAGA CCTTCCTTCC AAGTTTCCAA TGAAAATGGC AGAAGCGGTC CAGAAAGGCT TCGACCGGCG CCTCAAGATG CTGAGTGCAT GA
|
Protein sequence | MPRHATYAPL NVFINSRRVG VLRRESSGAV EFRYSDEWLA WAHAFPISLS LPMRPEKYVG APVLAVLENL LPDNEAILRR VAERVHANGT DAYNLLSAIG RDCVGALQFR PEDTQPGPAG VVDGEELEGH GIEQLLANLT RAPLGLTTDD DFRISIAGAQ EKTALLRWND RWWKPLGATA TTHIFKRPIG MAHNIDLSDS VENEYLCLRL TKALGIPVAN AEIEQFGKQK ALVIERFDRL WTSDKRLLRI PQEDCCQAFS MPPTKKYEAE GGPGAVKILT LLNASDTPTD DQKMFLKALI TFWLLGAIDG HAKNFSIRIA EGGRFQLSPL YDIVSGQPAV AAGSIRHNQF KLAMAVGKNR QYVVNSIAPR HFAETAAQAG VGRVVVEEAM QELHSAAVKN VSHVFKDLPS KFPMKMAEAV QKGFDRRLKM LSA
|
| |