Gene Hoch_2203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2203 
Symbol 
ID8544589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3061512 
End bp3063107 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content71% 
IMG OID646386910 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003266641 
Protein GI262195432 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0247328 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCCCTGG CGGCCGCGAT AGCGCTGCTC GGCGCCTGCG AGCGGCGCGC GCGGCGCACG 
CCCGACGACA CCCTGGTGGT GCTGGTGCCG ACCGCCATGG GCGAGATCGA TCCCCGCTTC
GTGGTCGGCA GCAACGACAC CAAGCTGTCG CGCCTCATCG CGCCCGGCCT CACCAGCATC
GAGCGGCACT CGCTCGAGCC CCAGCCGCTC CTGGCCGAGC GCATCGAGCA GCGCGACGAG
CTCACCTGGG ACGTGTATCT GCGCCGCGAT GCGCGCTTCT CCGACGGCAG CCCGGTGACC
GCGGCCGACG TCGCCTACAG CTACAATTCG GTGCTCGACC CGGCCACGGG CAGCCTCTAC
CGCCAGGGCT TCGAGACCCG CTACGAGCGC ATCGAGGCCG TGGACGAGCA CCACGTGCGT
TTTCACCTCG ACGCGCCGCT GGCCACCTTT CTCTCCGACA TCGAGTTCGG CATCGTGTCG
CAGCGCGCGG CCCAGGCCGG CGCCGCCGCG GGCGCGCCCC CGGGCCACTT CGCCGACGGC
CTGGTCATCG GCGCCGGCGC CTACTCGCCC ACCCTGGTCG CCAGCGAGCG CGTCGAGCTG
AGCCGCAACC CACACTATTT CGGACAGCCG GCCAAGCTCG AACACGTGGT CGTCCGCACC
GTGCGCGACG CCAACGCCCG CGCGCTCATG CTGGTCGGCG GCTCGGCCGA CCTGGCGCAG
AACGCCATCC GCCTCGATCT CGTGGACGCG GTCGACGAGC GCGAGCGCGT GCGCGTGGAC
AGCGGCCCCA GCGCCATCCT CTCGTACCTC ATGATGCAGA ACCGCGACCC CGTGCTCGCC
GACCTGCGCG TGCGCCGCGC CATCGCCTAC GCCATCGACC GCGAGCGCAT CATCGACGTC
AAATTCGGCG GCCGCGCGGA GCTGGCCTCG GGCCTCTTGC CGCCCGCGCA CTGGGCCTAC
GAGCCCGATG TAGCGCGCTA CGGCTACGAC CCCGCGCGCG CTCAGGCGCT GCTCGACGAG
GCCGGCTACC CCGACCCCGA CGGCCCCGGC GGCCAGCCTC GGCTGCGGCT ATCGTACAAG
ACCAGCGCCG ACCAGTTCCG GCTGTCGATC GCGCGCATCA TCGCCGCGCA GCTCGCCGAG
GTCGGCATCG AGGTCGACGT CCGGGCGTTC GAATTCGGCA CCTTCTTCGC CGACATCAAG
GCCGGCAACT ACCAGATCGC CACCATGCAG ACCGCGGCCA TCTCCGAGCC CGACTACTAC
TACGCGTACT TCCACTCCTC GCGCATCCCG ACCGACGAGG ATCCGCACCT CACCAACCGT
TGGCGCTACG AAAACCCGCG CGTCGACACC CTCACCGAGG AGGGCCGCAG CATCGCCGAG
CGCGAGCAGC GGCTGGTGCG CTACCGCGAG GTGCAGAAGA TCCTGGCCGA TGAGCTGCCC
GTGGTGCCGC TGTGGCACGA GGACAACATC GCGGTCATGA ACATCGAGGT CGAGGGCTTC
GAGATCCTGC CGCACGCCAG CTTGAGCGGC CTGGTGGCCA CCGACAAGCG GCGCGCGAGC
GGGTCGCCGG CGCGCGCTCG CGGCGCTGAC GAGTAG
 
Protein sequence
MALAAAIALL GACERRARRT PDDTLVVLVP TAMGEIDPRF VVGSNDTKLS RLIAPGLTSI 
ERHSLEPQPL LAERIEQRDE LTWDVYLRRD ARFSDGSPVT AADVAYSYNS VLDPATGSLY
RQGFETRYER IEAVDEHHVR FHLDAPLATF LSDIEFGIVS QRAAQAGAAA GAPPGHFADG
LVIGAGAYSP TLVASERVEL SRNPHYFGQP AKLEHVVVRT VRDANARALM LVGGSADLAQ
NAIRLDLVDA VDERERVRVD SGPSAILSYL MMQNRDPVLA DLRVRRAIAY AIDRERIIDV
KFGGRAELAS GLLPPAHWAY EPDVARYGYD PARAQALLDE AGYPDPDGPG GQPRLRLSYK
TSADQFRLSI ARIIAAQLAE VGIEVDVRAF EFGTFFADIK AGNYQIATMQ TAAISEPDYY
YAYFHSSRIP TDEDPHLTNR WRYENPRVDT LTEEGRSIAE REQRLVRYRE VQKILADELP
VVPLWHEDNI AVMNIEVEGF EILPHASLSG LVATDKRRAS GSPARARGAD E