Gene Hoch_1485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1485 
Symbol 
ID8543867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2016242 
End bp2017981 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content67% 
IMG OID646386196 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003265931 
Protein GI262194722 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.522952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAGAA GGACCTTGTT CTCGGACACC CGAGCGAAAT CGCTTGCCGC GAGCGTGCTG 
GCCGCCAGCG TGCTCGCCAC CGGCGGCCTG AGCGCTTGCA AAGACTCGCC CAACGCGCCC
AAGGCGTCCG ACAAAAGCGC GGCCAACGGC GAGCCCGCCG CGGCCGCCGC CGTGCTCGCG
GTCGGCATGC CCAACGGCCC GCAGACCGAG AACCACAACC CCTTCCTCAC CACCTCGGCC
GCCGCGTCGC TCGGCTATCG CTGGATGATG TACGAGCCGC TGATGATGTG GAATCCGGTC
AAGCCGGCCG AGCCCTCCAA GCCCTGGCTG GCCGCCAGCG TCAAGTGGTC GCCCGATTTC
AAGAGCGCCA CCATCGAGGT GCGCGACAAC GCCAAATGGA GCGACGGTAA GCCGGTGAGC
GGCGAGGACG TCGCCTACAC CTTCCAGCTC ATCAAAGACC ACGAGGCGCT CAACCTGCAG
GCCATCCCCT ACGGCGACAT CGCCGCCTCG GGCAACACCG TGACCGTCAG CTTCGACAGC
TCGATGTTCG TGTACAAGGA CAAGTGGCTG GGGCAGACGC CGATCGTGCC CAAGCACATC
TGGGAGACGG TCGAAAACCC GGCCACGCAC ACCAACAAGG CTCCGGTCGG CAGCGGCCCG
TACACGCTCA AGTCGTTCAC GCCGCAGACC ACCACGCTCA CGCTGCGCAG CGACGGCGGC
TACTGGCAGG ATCTGCCCGC GGTGGAGGAG CTGCGCTACA CCTCGTATCT CGACAACAAC
GCCCAGACCA CGGCGCTGTC CGACGGCTCG TCCGAATGGT CGTTCGTCTT CATTCCCAAC
TACAAGACCG TGTTCGTCGA CAAAGACCCG GAGCACTACC ACGTGTGGGC GCCGCCGGTG
CTCGGCATCC ACGGCCTGTA CATCAACACC ACCAAGCCGC CCTTCGACGA CGTCGCGCTG
CGCAAGGCCA TGAACCTGGT CATCAACCGC GAGGACATCT TCTACCAGGC CGAGGCCGGC
TACTTCCACC CGCTGGTGAC CAACGTGAGC GGTCTGCCCT CGCCCGCCGG TGACGCGTTC
ATCGCCGAGG CGTACGAAAA CCAGGAGCAC AGCGTCGATG TCGAGGGCGC CAAGAAGCTG
CTCGCCGAGG CCGGCTACAA GCTCGAGGGC GAGACCCTCA AGGACCCCAA CGGCAAGCCG
GTGACGCTGA CGCTGACCGA TCCGGCCGGC TGGTCCGATT ACCAGACCTC GCTCGAGATC
GTCAAAGACA ACCTGTCCGA AATCGGCATC GCGGCCACGG TGGAGAAGGC CAACCACGAC
GCCTGGTTCC GCAACGTCGA GGAAGGCAAC TTCGACGCCA CCTTCCGCTG GACCAACAGC
GGATCCACGC CTTACGACAT CTACCACACG GTCATGGACG GCGCGCTGCT CAAGCCCGTG
GGCACGGCGT CGCCGGGCGG CAACTTCGGC CGCTTCGACA GCCCGGAGGC GACCGCGGCG
CTGCGCGAGT ACGCCGATGG CAACGACGAC GCCGCCCGCA GCGCGGCGCT CGCCACCTTG
CAGCGGGTGT TCGTCGAGCA GGTGCCCATG ATCCCGGTCG GCGCCGACAA CATCGGCATG
GCCTACAGCA CCAAGAACTG GGTCGGCTGG CCCGACGAGA CCAACCCGTA CACGGCCGGG
CAGCCGACGC AGCCCAACGC CCTGGACGTG GTCCTGCACC TCGAGCCCGC CGGCTCCTGA
 
Protein sequence
MQRRTLFSDT RAKSLAASVL AASVLATGGL SACKDSPNAP KASDKSAANG EPAAAAAVLA 
VGMPNGPQTE NHNPFLTTSA AASLGYRWMM YEPLMMWNPV KPAEPSKPWL AASVKWSPDF
KSATIEVRDN AKWSDGKPVS GEDVAYTFQL IKDHEALNLQ AIPYGDIAAS GNTVTVSFDS
SMFVYKDKWL GQTPIVPKHI WETVENPATH TNKAPVGSGP YTLKSFTPQT TTLTLRSDGG
YWQDLPAVEE LRYTSYLDNN AQTTALSDGS SEWSFVFIPN YKTVFVDKDP EHYHVWAPPV
LGIHGLYINT TKPPFDDVAL RKAMNLVINR EDIFYQAEAG YFHPLVTNVS GLPSPAGDAF
IAEAYENQEH SVDVEGAKKL LAEAGYKLEG ETLKDPNGKP VTLTLTDPAG WSDYQTSLEI
VKDNLSEIGI AATVEKANHD AWFRNVEEGN FDATFRWTNS GSTPYDIYHT VMDGALLKPV
GTASPGGNFG RFDSPEATAA LREYADGNDD AARSAALATL QRVFVEQVPM IPVGADNIGM
AYSTKNWVGW PDETNPYTAG QPTQPNALDV VLHLEPAGS