Gene Hoch_5154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5154 
Symbol 
ID8547565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7098705 
End bp7100573 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content65% 
IMG OID646389830 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_003269535 
Protein GI262198326 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCGCA ACGCCCTCCT CAAGGTTGGA ATCTCTGCCG TGGTGCTGGC AGCGGCGCTG 
TACGCGGCCT TTAGTCTGTA TATGTCGGGC GAGCCGGCGC TCGCTGCCGT CTGTCTGGGC
GCCTTCGCGC TCGGCGTCTA CGTGTACACG GCGGCGCGCG CGCGCACCTA TCGCTATCTC
TTCCCCGGTT TGGCAGGACT CGCGATCTTC GTCTTGTTGC CGCTGGTCTA CACCATCGCC
CTGGGCTTCA CGCGCTACCG CACCGAGAAC TTCCTCACCT TCGAGCGCGC CACCGCCGCG
CTGCTGGGCG AGACCTTCGA GCGTCCGGGC GATCGCTACC GGGTGAGCCT GCTGAGCGCC
GAGGACGGCG CGTACCGGCT GCTGCTCGAG ACCGTGGCCC AGCCCGCCGA GGCTGTCGAG
CCGGGCGAGC CCGCCGACGC CGCCGATGAA GGCGACAGCA TCTTCGGCGA CGAGGACGAG
GGCGACAGCA TCTTCGGGGA GGAAGAGGAG GAGGACAGCA TCTTCGCCGA CGACGCGGAC
GCGGAGGCGG GGGACGAGGA CAGCATCTTC GGGGAGGAGG AGGAGGACAG CATCTTCGCC
GACGACGAGG ACAGCATCTT CGGGGAGGAG GAAGAGGACA GCATCTTCGC CGACGATGAG
GATGGCGACG CCGCGGCCGA TGACGATGCC GGCGAGGATT CGCAGGGCGC GCGCGGCGCG
TTCTTCGTCT CCGAGCCCGT GGCTCTCGAC GAGCTGGTGG GCGAGGACGT CGGCGAGGAC
CCGGCGGTGT TGCGCGCCGT GCCCCTGAGC GAGCTGCCCG CCGTCACGCC CGAGGATGCG
CTCTCGCGCA TCGAGCTCAT CAAGCGCCGC GACGCCTTGC AGCGGGTATC GGTGCGCTTC
CCGGACGATA GCTGGGCGGT CAAAGACAAG CTCAGCATCG ACCTGTTCTT GCCGCAGACC
GCGCTCTACA CCCAGAACCC CGATCAAACG CTGGTCAACA ATCGCACCGG CGAGGTGCTG
ACGCCGAACT TCGACACCGG CTTCTACGAG AACGCGGCTG GCGAGTTCGT CCGTCCCGGT
TTTCGCGTGC TTGTCGGCTT CGACAACTAC ATGCGCCTGC TCACCGACCG GCGCATTCAG
GAACCCTTCC TGCGCATCTT CCTGTGGACG GTGCTGTTCG CCGGGTTCAC GGTGCTGTTC
ACGCTCATCG TCGGGCTGGT GCTGGCGGAG CTGATGAGCT GGGAGGCGCT GCCGATGCGG
GGCCTGTACC AGATCCTCTT GTTCCTGCCT TACGCGGTGC CCGGCTTTAT CTCGATCCTG
GTGTTCAAGG GTCTGTTCAA CTCGGCATCG GGCGAGATCA ATCAGATTCT GCTGGATCTC
TTCGGCGTCG CGCCCGATTG GTTCGGCGAC CCGTTCCTGG CCAAGGTGAT GATTCTCATC
GTCAATACCT GGCTCGGGTA CCCGTACATC ATGCTGCTGT GCATGGGTCT CAAGAAGTCG
GTGCCCTCGG ACCTGTACGA GGCCACGGCG CTGGCCGGCG CCAGTCCGCT CACCAACTTC
CTCAAGATCA CCTGGCCGCT CATCCGCAAG CCGCTCACGC CGCTGCTCAT CGCGTCGTTC
GCGTTCAACT TCAACAACTT CGTGCTGGTG TTCCTGCTCA CGGGCGGTCG CCCCGACTTC
CTCAATACCA GCACGCCGGC GGGCGAGACC GATATCCTGG TGAGCTACAC TTATCGCATC
GCGTTCCAGG ATTCCGGCCA GAACTACGGT CTGGCCGGCG CCATCTCGAC CCTGATCTTC
GTCCTGGTCG CCATCCTGTC GATCGTGAAT CTGCGGATGA CGAACGTGAA CAAGGAAGAG
AGGCGCTGA
 
Protein sequence
MMRNALLKVG ISAVVLAAAL YAAFSLYMSG EPALAAVCLG AFALGVYVYT AARARTYRYL 
FPGLAGLAIF VLLPLVYTIA LGFTRYRTEN FLTFERATAA LLGETFERPG DRYRVSLLSA
EDGAYRLLLE TVAQPAEAVE PGEPADAADE GDSIFGDEDE GDSIFGEEEE EDSIFADDAD
AEAGDEDSIF GEEEEDSIFA DDEDSIFGEE EEDSIFADDE DGDAAADDDA GEDSQGARGA
FFVSEPVALD ELVGEDVGED PAVLRAVPLS ELPAVTPEDA LSRIELIKRR DALQRVSVRF
PDDSWAVKDK LSIDLFLPQT ALYTQNPDQT LVNNRTGEVL TPNFDTGFYE NAAGEFVRPG
FRVLVGFDNY MRLLTDRRIQ EPFLRIFLWT VLFAGFTVLF TLIVGLVLAE LMSWEALPMR
GLYQILLFLP YAVPGFISIL VFKGLFNSAS GEINQILLDL FGVAPDWFGD PFLAKVMILI
VNTWLGYPYI MLLCMGLKKS VPSDLYEATA LAGASPLTNF LKITWPLIRK PLTPLLIASF
AFNFNNFVLV FLLTGGRPDF LNTSTPAGET DILVSYTYRI AFQDSGQNYG LAGAISTLIF
VLVAILSIVN LRMTNVNKEE RR