Gene Hoch_5151 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5151 
Symbol 
ID8547562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7094619 
End bp7095821 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content66% 
IMG OID646389827 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003269532 
Protein GI262198323 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.385349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACAGGG TACTGGCTAA ACTTGGGTTC ATCACGGCGA CCGCCATCGC CGTGCTCGCC 
ACCTCGCTCG GCGGGCAAGC GCTCGCCGAC GAGCAGGGCA CGCTGACCGT GTGGATCAAC
GGTGACAAGG GCTATCGCGG TCTCGAGCAG ATCGGCAAGC GCTTCACCAA GGACACCGGC
GTCAAGGTCG TGGTCGAGCA CCCCGAGGAT GCTCCCGGCA AGTTCCAGCA GGCGGCCGCC
ACCGGCCAGG GCCCCGACAT CTTCTTCTGG GCGCACGATC GCGCCGGCGA GTGGGTGCAG
GCCGGCCTCA TCGAGCCGGT CAAGCCCGAC GCCAAGTTCG CGCGCCAGTT CGAGCGCATG
GCCTGGGACG CGTGGAAGTT CGGCGGCAAG TACTACGGCT ACCCGGTGGC CATCGAGGCC
ATCGCCCTCA TCTACAATAC CGACCTGGTC AAGACCCCGC CCAAGAGCTT CGACCAAGTG
GTCGCCCTCA ACGCGCAGCT CTCCAAGCAG GGCAAGAGCG CCATCCTCTG GGACTACAAC
AACACCTACT TCACCTGGCC GCTCCTGGCC GCCAACGGCG GCTACGTGTT CAAGCGCCAG
GCCAACGGCG ACTACAACGC CAAGGACGTG GGCGTGAACA ACGCCGGCGC GCTCAAGGGC
GCCAACCTGC TCCTCGAGCT GATCCAGAAG GGCATCATGC CCAAGGGCGC CGCCTACGAG
ACCATGGAGG GCAAGATGCT CAAGGGCGAG CTGGGCATGA TGATCAGCGG CCCCTGGGCC
TGGGAGAACC TGCGCAAGAA CAAGATCCCG TTCAGCATCG CGCCCATCCC GTCGATCGCC
GGTAAGCCCG CGCGTCCCTT CGTCGGCGTG CTCGGCGCCA TGATCAACCG CTCCAGCAGC
GACAAGGATC TGGCCCGTGA GTTCCTCGAG AAGTACGTGC TCAACGCGCG CGGCCTCGAC
AACATCAACA GCGCCGTGCC CCTGGGCGTG CCCGCGAACA AGAGCTACTA CCGCCAGCTC
GCCAAGAAGG ACCCGCTGGT CAAGCAGACC ATGCTCAGCG CCAAGAACGG CATGCTCATG
CCCTCGCACC CCAAGATGGG CAGCTTCTGG TCGGCCATGC AGTCGGCGCT CGAGAACATC
ACCAATCAGC GCCAGCCGCC CAAGCAGGCG CTCGACGCCG CCGCTCGCCG CATGGCGAAC
TGA
 
Protein sequence
MHRVLAKLGF ITATAIAVLA TSLGGQALAD EQGTLTVWIN GDKGYRGLEQ IGKRFTKDTG 
VKVVVEHPED APGKFQQAAA TGQGPDIFFW AHDRAGEWVQ AGLIEPVKPD AKFARQFERM
AWDAWKFGGK YYGYPVAIEA IALIYNTDLV KTPPKSFDQV VALNAQLSKQ GKSAILWDYN
NTYFTWPLLA ANGGYVFKRQ ANGDYNAKDV GVNNAGALKG ANLLLELIQK GIMPKGAAYE
TMEGKMLKGE LGMMISGPWA WENLRKNKIP FSIAPIPSIA GKPARPFVGV LGAMINRSSS
DKDLAREFLE KYVLNARGLD NINSAVPLGV PANKSYYRQL AKKDPLVKQT MLSAKNGMLM
PSHPKMGSFW SAMQSALENI TNQRQPPKQA LDAAARRMAN