Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3917 |
Symbol | |
ID | 8546313 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 5400600 |
End bp | 5401529 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646388589 |
Product | periplasmic solute binding protein |
Protein accession | YP_003268309 |
Protein GI | 262197100 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.247464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.1329 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCATA GAATCATCCC CCGCGCCCTG CTCCCCCTGC TCCTGGCCGC GGCCGCGCTG TGGCCGGCCA GCGCCCGCGC CGACGTCGAC ATCGTCGCCA GCGTCCCCGA CCTCGCCGCC CTGGCCCACG CGATCGCCGG CTCCCACGGG AAGGTGCAGG CGCTGTCGCT GCCCACCCAG GATCCCCACT GGGTCGACGC CAAGCCCAGC CTCGCGCTCC AGGTCAACCG CGCCGACCTG CTCATCGCCG TGGGCATGGA GCTCGAGGTC GGCTGGCTGC CCAAGCTGCA GACCGGCTCC CGCAACCCCA AGGTCCAGCG CGGCGCCGCC GGCTTCCTCG AGTGCTCGGC CTTCGTCGAC GCCCTGGAGC GCCCCACCGG CCCGATCGAC CGCAGCATGG GCGACATCCA CCCGGGCGGC AACCCCCACT ACCTGCTCGA CCCGCGCCGC GCCGTCGACT GCGCGCGCGG CATCGCCGAG CGTCTGGCCG AGCTCGACCC GGCCAACGCC GGCGCCTACC GCAAGAACCT GGCGACCTTC ACCGCCGCGC TCGCGGCCAA ACGCACCGAG TGGGAGCGGC GCCTCAGCGC CTACCGCGGC GCGCCCATCG TCACCTATCA CAAGTCCTGG GTGTATCTGT CCGACTGGCT GGGCTTCGAC GAGGTCGGCT ACCTCGAGCC CAAACCCGGC ATCGCGCCGA CCCCGACCCA CGTCGCCCAG CTCATCGCCC GGGCGCGCCT GCGCAAGGTC GGCCTGCTGC TGCAGGAGAG CTACTACCCC AGCAACACCG GCAAGCTGGT CGCCGGCAAG ATCGGCGCGC GCCTGCTGGT CCTGCCCGCG GCCACCAACA CCGCGCGCGG ACAGAGCTAC ATCGCCCACA TCGACGAGCT GGTGAGCGCC ATCGAGCGGG CGCTGGGGAG CAAATCATGA
|
Protein sequence | MMHRIIPRAL LPLLLAAAAL WPASARADVD IVASVPDLAA LAHAIAGSHG KVQALSLPTQ DPHWVDAKPS LALQVNRADL LIAVGMELEV GWLPKLQTGS RNPKVQRGAA GFLECSAFVD ALERPTGPID RSMGDIHPGG NPHYLLDPRR AVDCARGIAE RLAELDPANA GAYRKNLATF TAALAAKRTE WERRLSAYRG APIVTYHKSW VYLSDWLGFD EVGYLEPKPG IAPTPTHVAQ LIARARLRKV GLLLQESYYP SNTGKLVAGK IGARLLVLPA ATNTARGQSY IAHIDELVSA IERALGSKS
|
| |