Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2203 |
Symbol | |
ID | 8544589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 3061512 |
End bp | 3063107 |
Gene Length | 1596 bp |
Protein Length | 531 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646386910 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003266641 |
Protein GI | 262195432 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0247328 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCCTGG CGGCCGCGAT AGCGCTGCTC GGCGCCTGCG AGCGGCGCGC GCGGCGCACG CCCGACGACA CCCTGGTGGT GCTGGTGCCG ACCGCCATGG GCGAGATCGA TCCCCGCTTC GTGGTCGGCA GCAACGACAC CAAGCTGTCG CGCCTCATCG CGCCCGGCCT CACCAGCATC GAGCGGCACT CGCTCGAGCC CCAGCCGCTC CTGGCCGAGC GCATCGAGCA GCGCGACGAG CTCACCTGGG ACGTGTATCT GCGCCGCGAT GCGCGCTTCT CCGACGGCAG CCCGGTGACC GCGGCCGACG TCGCCTACAG CTACAATTCG GTGCTCGACC CGGCCACGGG CAGCCTCTAC CGCCAGGGCT TCGAGACCCG CTACGAGCGC ATCGAGGCCG TGGACGAGCA CCACGTGCGT TTTCACCTCG ACGCGCCGCT GGCCACCTTT CTCTCCGACA TCGAGTTCGG CATCGTGTCG CAGCGCGCGG CCCAGGCCGG CGCCGCCGCG GGCGCGCCCC CGGGCCACTT CGCCGACGGC CTGGTCATCG GCGCCGGCGC CTACTCGCCC ACCCTGGTCG CCAGCGAGCG CGTCGAGCTG AGCCGCAACC CACACTATTT CGGACAGCCG GCCAAGCTCG AACACGTGGT CGTCCGCACC GTGCGCGACG CCAACGCCCG CGCGCTCATG CTGGTCGGCG GCTCGGCCGA CCTGGCGCAG AACGCCATCC GCCTCGATCT CGTGGACGCG GTCGACGAGC GCGAGCGCGT GCGCGTGGAC AGCGGCCCCA GCGCCATCCT CTCGTACCTC ATGATGCAGA ACCGCGACCC CGTGCTCGCC GACCTGCGCG TGCGCCGCGC CATCGCCTAC GCCATCGACC GCGAGCGCAT CATCGACGTC AAATTCGGCG GCCGCGCGGA GCTGGCCTCG GGCCTCTTGC CGCCCGCGCA CTGGGCCTAC GAGCCCGATG TAGCGCGCTA CGGCTACGAC CCCGCGCGCG CTCAGGCGCT GCTCGACGAG GCCGGCTACC CCGACCCCGA CGGCCCCGGC GGCCAGCCTC GGCTGCGGCT ATCGTACAAG ACCAGCGCCG ACCAGTTCCG GCTGTCGATC GCGCGCATCA TCGCCGCGCA GCTCGCCGAG GTCGGCATCG AGGTCGACGT CCGGGCGTTC GAATTCGGCA CCTTCTTCGC CGACATCAAG GCCGGCAACT ACCAGATCGC CACCATGCAG ACCGCGGCCA TCTCCGAGCC CGACTACTAC TACGCGTACT TCCACTCCTC GCGCATCCCG ACCGACGAGG ATCCGCACCT CACCAACCGT TGGCGCTACG AAAACCCGCG CGTCGACACC CTCACCGAGG AGGGCCGCAG CATCGCCGAG CGCGAGCAGC GGCTGGTGCG CTACCGCGAG GTGCAGAAGA TCCTGGCCGA TGAGCTGCCC GTGGTGCCGC TGTGGCACGA GGACAACATC GCGGTCATGA ACATCGAGGT CGAGGGCTTC GAGATCCTGC CGCACGCCAG CTTGAGCGGC CTGGTGGCCA CCGACAAGCG GCGCGCGAGC GGGTCGCCGG CGCGCGCTCG CGGCGCTGAC GAGTAG
|
Protein sequence | MALAAAIALL GACERRARRT PDDTLVVLVP TAMGEIDPRF VVGSNDTKLS RLIAPGLTSI ERHSLEPQPL LAERIEQRDE LTWDVYLRRD ARFSDGSPVT AADVAYSYNS VLDPATGSLY RQGFETRYER IEAVDEHHVR FHLDAPLATF LSDIEFGIVS QRAAQAGAAA GAPPGHFADG LVIGAGAYSP TLVASERVEL SRNPHYFGQP AKLEHVVVRT VRDANARALM LVGGSADLAQ NAIRLDLVDA VDERERVRVD SGPSAILSYL MMQNRDPVLA DLRVRRAIAY AIDRERIIDV KFGGRAELAS GLLPPAHWAY EPDVARYGYD PARAQALLDE AGYPDPDGPG GQPRLRLSYK TSADQFRLSI ARIIAAQLAE VGIEVDVRAF EFGTFFADIK AGNYQIATMQ TAAISEPDYY YAYFHSSRIP TDEDPHLTNR WRYENPRVDT LTEEGRSIAE REQRLVRYRE VQKILADELP VVPLWHEDNI AVMNIEVEGF EILPHASLSG LVATDKRRAS GSPARARGAD E
|
| |