Gene Hoch_3146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3146 
Symbol 
ID8545534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4324878 
End bp4326197 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content66% 
IMG OID646387813 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003267541 
Protein GI262196332 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.654082 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.268339 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAC TCTTTTCTGT GTTGCTCGCC TGCGCCACGA TCGGCGCCTG GGGCTGCGAT 
AAAAAAGACG AGGCCGCGGG AACGGAAACC GCAGCGACCA CCGAAGAGGC CAAGGAAGTC
ACGATCTCGC TGTCATGCGG CGCGGTCGGC CAGGAGCTCG AGCTGTGCAA GAAGAGCGCA
GAGGAGTGGT CGAAGAAGAC CGGTAACAAG GTCAACATCA TCTCGACGCC CAACGGCTCC
ACCGATCGCC TGGCTCTCTA CCAGCAGATC CTGGGCGCGG CCTCCAATGA CATCGATGTG
TTCCAGATCG ACGTGGTCTG GCCGGGCGTG CTCGCCAGCC ACTTCCTCGA CCTCAAGCCG
CACCTGGGCG GCGCCGAGAG CGAGTTCTTC CCCGCGCTGA TCGAGAACAA CACGGTCGGC
GACAAGCTGG TCGCCCTGCC CTGGTTCACC GACGCGGGCG TGCTCTACTA CCGCAAGGAC
CTGCTCGAGA AGTACGGCGC CGAGCCGCCC ACGACCTGGG CCGAGATGGC CGAGACCGCC
AAGAAGATCC AGGACGGCGA GCGCGAGGCC GGCAACGACG GCATGTGGGG CTACGTGTTC
CAGGGCAAGG CCTACGAGGG CCTCACCTGC AACGGCCTCG AGTGGGTGCA CAGCTTCGGC
GGCGGCACCA TCGTCGACGA GTCGGGCAAG GTCACCATCA ACAACCCGCA GGCCGCGCAG
GCGCTCGACA CCGCCGCCGG CTGGATCGGC ACCATCGCGC CCGAGGGCGT GCTCAACTAC
GCCGAGGAAG AGGCCCGCAG CCTGTTCCAG TCGGGCAACG CGGTGTTCAT GCGCAACTGG
CCCTACGCCT GGGGCATGGC GCAGGCCGAC GACAACATGA AGGACAAGGT CGGCGTGATC
GCGCTGCCCA AGGGCGGCGA CGGCGGCACG CACGCGGCCA CGCTCGGCGG CTGGGGCCTC
GCGGTGTCCA AGTACACCAA GAACGAGGCC GCGGCGGCCG ACCTGGTCAA GCACCTCACC
AGCGCCGAGG TGCAGAAGAT GCGCGCCATC GAGGGTTCCT TCAACCCGAC CATCGACTCG
CTGTACAAAG ACCAGCAGGT TCTCGAGGCC ACGCCGTTTT TCGGCACGCT GTACGAAACC
TTCGCCAACG CTGCGGTGCC GCGCCCGGCG GCGCAGACCG GCTCGAAGTA CAACCAGGTG
TCGAACGCGT TCTGGAACGC GAGCTACGAC GTGCTCTCGG GCAAGACCAA GGCCGCCGAC
AGCCTGGCCG AGCTCGAGAC CAAGCTCAAC GACCTGAGCC GCGGCGGCAG CGCCTGGTAA
 
Protein sequence
MKKLFSVLLA CATIGAWGCD KKDEAAGTET AATTEEAKEV TISLSCGAVG QELELCKKSA 
EEWSKKTGNK VNIISTPNGS TDRLALYQQI LGAASNDIDV FQIDVVWPGV LASHFLDLKP
HLGGAESEFF PALIENNTVG DKLVALPWFT DAGVLYYRKD LLEKYGAEPP TTWAEMAETA
KKIQDGEREA GNDGMWGYVF QGKAYEGLTC NGLEWVHSFG GGTIVDESGK VTINNPQAAQ
ALDTAAGWIG TIAPEGVLNY AEEEARSLFQ SGNAVFMRNW PYAWGMAQAD DNMKDKVGVI
ALPKGGDGGT HAATLGGWGL AVSKYTKNEA AAADLVKHLT SAEVQKMRAI EGSFNPTIDS
LYKDQQVLEA TPFFGTLYET FANAAVPRPA AQTGSKYNQV SNAFWNASYD VLSGKTKAAD
SLAELETKLN DLSRGGSAW