Gene Hoch_5439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5439 
Symbol 
ID8547852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7463732 
End bp7465513 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content75% 
IMG OID646390112 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003269815 
Protein GI262198606 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.157729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTAG GCATGCTGTT CACCGGGGCG GCGCTGATCG CGCTGGCGTT CACCTGTGAG 
GGCCCGAGCG CGCGCTCCAA GCCCTGGCGC CACGCCGAGC TCGACCTCGC CGAGGTCGCG
CCCTCCTCGG AGCTGATCGC GGCCGAGGCC GCGCGCGTGG CCTCGCGGCG GCGCCGCGGC
AGCACCCTGC GCATCCTGCT CGACGCCAAC CCGCGGCACC TCAACCCGCT GATCGCGCCG
ACCACGTGGA CCCGGCGCAT CGCCATGGAC ACGGTGTTCG AGAGCCTGGT GCGCTACGAG
CCGCCGAGCG GCGGCGCCGG CACCGGGCCC GGGCGCTACC GGCCCGGGCT GGCGCGCTCG
TGGACGCTGT CGCAGGGCGG GCGCGAGCTG CGCCTCGAGC TGAGCCCCGA GGTCACCTTC
CACGACGGCT CGCGCATGTC GTCGGTGGAC GTGCAGTTCT CGCTCGACAC CGCGCGCTCG
CCCAAGTACG CGGCCGATCA CCTGCGGCCC ATGCTGGCCG ATGTCCTGGC GGTCGAGATC
CTCGGCCCGC GCACGGTGCG CGTGCGTCTG TCCCGGCCCA ACGGCTACGT GCTGCGCGCC
CTGGCCGAGA TCCCCATCCT GCCCGCCCAG GTGTACGAAA AACGCCTGCG GGCGGCGCGC
GGACCCGTGG TCGGCACCGG CCCGTATCGA CTCGAGAGCT GGGACGACGA GCTGATACGG
CTCGAGCGCT CTGACAGCTA TTGGGGCGCG GCCCCGGCCA TCCCGCGCAT TGAATTCGTC
CACCAGCCCG ACGACGCCCG CGCGCTCACC GAGGCCAAGC GCGGCGAGCT CGACATCGTG
CCGGCCCTGA TCCCGTCGCA CTATCCCGAG CAGGCCTCGG CGCCCGGGCT GGCGCGCGTG
TTCCGGCCGC TGCGCCTGCG CCCGCCGGCG CTGCGCTACA TGGTCATGAA CACGGCCAAC
CCGGCCCTGA GCGACGTGCG CGTGCGCCAG GCGTTGGCGC TGCTGATCGA CCGCAAGGCG
CTGATCAAGA GCGAGCACGA CGGCCTGGCG CGAGCCGTGG CCGGCTTCGT GTGGCCCGGC
GGCCCCGGTG ACGGACCGGC GCCGAGCCCG CCCGACTACG ATCCTGCGCG CGCCGGTGCC
CTGCTCGACG CCGCGGGCTG GCGCGATCGC GACGGCGACG GCGTGCGCGA GCTGGGCGAG
GACAAGCTGC GGCTCACGCT GCTGGTCACC GACAGCGGCG AGCCCAGCGA CGACGACGAC
GCCGAGGAGG CCCGCGACCG CGGCGAGCGC GAGCGTATCG TGCGCAGCCT GCGCCGCTCG
GGCGTGCAGA TCGACCGCCG CGTGGGGCCG CCCGCCGTGC TGCGCAACCG CCTGCGCGCG
GGCGACTTCG ACCTGGCCTT CGCGCACTGG CGCGGCCTGG TCGACGACGA CCTGGCGCCG
CTGCTCGAGA GCGGCTCATC GCTCAACCTC GGCGGCTTCT CGAGCCCGCG GGTCGACGCT
GTGCTCGCCG CTCTGCGCGC GGCCTGGGAG CCGGGCGCGC GGGCGCCGCG CATGGCCGAG
CTGGCCCAGG TGCTGGGTGA GACCTGGCCG GTGGCTGCGA TCGTCGCCCC CGATCCCTAC
GGGCTCATCC ACCGCCGCGT GCGCGGCGCC GTGGTGTGGA ATGGCTGGCT GGTGCTGCGC
TCGCTGTCGC TCGACCCCGA TCGCGACGAC GAAGGCGAAG CCGCGGGCGC AGCCGCGGGC
GCTGGCTCGG GCGCCGCGGA CCCGAGCGGC GAGGCGCCAT AG
 
Protein sequence
MLLGMLFTGA ALIALAFTCE GPSARSKPWR HAELDLAEVA PSSELIAAEA ARVASRRRRG 
STLRILLDAN PRHLNPLIAP TTWTRRIAMD TVFESLVRYE PPSGGAGTGP GRYRPGLARS
WTLSQGGREL RLELSPEVTF HDGSRMSSVD VQFSLDTARS PKYAADHLRP MLADVLAVEI
LGPRTVRVRL SRPNGYVLRA LAEIPILPAQ VYEKRLRAAR GPVVGTGPYR LESWDDELIR
LERSDSYWGA APAIPRIEFV HQPDDARALT EAKRGELDIV PALIPSHYPE QASAPGLARV
FRPLRLRPPA LRYMVMNTAN PALSDVRVRQ ALALLIDRKA LIKSEHDGLA RAVAGFVWPG
GPGDGPAPSP PDYDPARAGA LLDAAGWRDR DGDGVRELGE DKLRLTLLVT DSGEPSDDDD
AEEARDRGER ERIVRSLRRS GVQIDRRVGP PAVLRNRLRA GDFDLAFAHW RGLVDDDLAP
LLESGSSLNL GGFSSPRVDA VLAALRAAWE PGARAPRMAE LAQVLGETWP VAAIVAPDPY
GLIHRRVRGA VVWNGWLVLR SLSLDPDRDD EGEAAGAAAG AGSGAADPSG EAP