Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5151 |
Symbol | |
ID | 8547562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 7094619 |
End bp | 7095821 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646389827 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003269532 |
Protein GI | 262198323 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.385349 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACAGGG TACTGGCTAA ACTTGGGTTC ATCACGGCGA CCGCCATCGC CGTGCTCGCC ACCTCGCTCG GCGGGCAAGC GCTCGCCGAC GAGCAGGGCA CGCTGACCGT GTGGATCAAC GGTGACAAGG GCTATCGCGG TCTCGAGCAG ATCGGCAAGC GCTTCACCAA GGACACCGGC GTCAAGGTCG TGGTCGAGCA CCCCGAGGAT GCTCCCGGCA AGTTCCAGCA GGCGGCCGCC ACCGGCCAGG GCCCCGACAT CTTCTTCTGG GCGCACGATC GCGCCGGCGA GTGGGTGCAG GCCGGCCTCA TCGAGCCGGT CAAGCCCGAC GCCAAGTTCG CGCGCCAGTT CGAGCGCATG GCCTGGGACG CGTGGAAGTT CGGCGGCAAG TACTACGGCT ACCCGGTGGC CATCGAGGCC ATCGCCCTCA TCTACAATAC CGACCTGGTC AAGACCCCGC CCAAGAGCTT CGACCAAGTG GTCGCCCTCA ACGCGCAGCT CTCCAAGCAG GGCAAGAGCG CCATCCTCTG GGACTACAAC AACACCTACT TCACCTGGCC GCTCCTGGCC GCCAACGGCG GCTACGTGTT CAAGCGCCAG GCCAACGGCG ACTACAACGC CAAGGACGTG GGCGTGAACA ACGCCGGCGC GCTCAAGGGC GCCAACCTGC TCCTCGAGCT GATCCAGAAG GGCATCATGC CCAAGGGCGC CGCCTACGAG ACCATGGAGG GCAAGATGCT CAAGGGCGAG CTGGGCATGA TGATCAGCGG CCCCTGGGCC TGGGAGAACC TGCGCAAGAA CAAGATCCCG TTCAGCATCG CGCCCATCCC GTCGATCGCC GGTAAGCCCG CGCGTCCCTT CGTCGGCGTG CTCGGCGCCA TGATCAACCG CTCCAGCAGC GACAAGGATC TGGCCCGTGA GTTCCTCGAG AAGTACGTGC TCAACGCGCG CGGCCTCGAC AACATCAACA GCGCCGTGCC CCTGGGCGTG CCCGCGAACA AGAGCTACTA CCGCCAGCTC GCCAAGAAGG ACCCGCTGGT CAAGCAGACC ATGCTCAGCG CCAAGAACGG CATGCTCATG CCCTCGCACC CCAAGATGGG CAGCTTCTGG TCGGCCATGC AGTCGGCGCT CGAGAACATC ACCAATCAGC GCCAGCCGCC CAAGCAGGCG CTCGACGCCG CCGCTCGCCG CATGGCGAAC TGA
|
Protein sequence | MHRVLAKLGF ITATAIAVLA TSLGGQALAD EQGTLTVWIN GDKGYRGLEQ IGKRFTKDTG VKVVVEHPED APGKFQQAAA TGQGPDIFFW AHDRAGEWVQ AGLIEPVKPD AKFARQFERM AWDAWKFGGK YYGYPVAIEA IALIYNTDLV KTPPKSFDQV VALNAQLSKQ GKSAILWDYN NTYFTWPLLA ANGGYVFKRQ ANGDYNAKDV GVNNAGALKG ANLLLELIQK GIMPKGAAYE TMEGKMLKGE LGMMISGPWA WENLRKNKIP FSIAPIPSIA GKPARPFVGV LGAMINRSSS DKDLAREFLE KYVLNARGLD NINSAVPLGV PANKSYYRQL AKKDPLVKQT MLSAKNGMLM PSHPKMGSFW SAMQSALENI TNQRQPPKQA LDAAARRMAN
|
| |