Gene Hoch_3143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3143 
Symbol 
ID8545531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4321781 
End bp4322920 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content67% 
IMG OID646387810 
ProductABC transporter related protein 
Protein accessionYP_003267538 
Protein GI262196329 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.27093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.0973127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCCA TCAAGATGCG CAATGTCCAC AAGCGCTTTG GCAACACCCA CGTGATCAAG 
GGCGTGGACC TCGACATCGA GGACGGCGAC TTCTGCGTCT TCGTCGGTCC CTCGGGCTGC
GGCAAGTCCA CCATGCTGCG GCTCATCGCC GGTCTCGAGG ACATCTCGTC CGGCGACCTG
TTCATCGGCG AGAAGAAGGT CAACACCGTG GCGCCCTCGC GCCGCGGCGT GGCCATCGTG
TTCCAGTCCT ACGCCCTGTA TCCGCACATG AACGTGTACG ACAACATGGC CTTCGGGCTC
AAACTGTCGC GCCAGGGCAA GGACGAGATC AAACAGCGCG TCGAAGAGGC GGCCAAGATC
CTGCAGATCG ACCACCTGCT GCACCGCCTG CCCAAGGAGC TGTCGGGCGG CCAGCGCCAG
CGCGTGGCCA TCGGCCGCGC CATCACCCGC CAGCCGCAGG TGTTCCTGTT CGACGAGCCG
CTGTCCAACC TCGACGCCGC GCTGCGCGTG CAGACCCGGC TCGAGCTGGC CAAGCTGCAC
GAGCGCCTGG GCACGACCAT GGTCTACGTG ACCCACGACC AGGTCGAGGC CATGACCCTG
GCCGACAAGA TCGTCATCCT CAATGCCGGA CACGTGGCCC AGGTCGGCGC GCCGCTCGAG
CTGTATCACT TCCCCGACAA CCTCTTCGTG GCCGGCTTCA TCGGTTCGCC GAAGATGAAC
TTCATCCCCT GCATCGTCGA CGAAGCCGAC GCCGACGGCG CCGCCATCAC GCTCTCGGAC
GGCACCCGCA TGCGCGTGCC CGTGGACGCG GCGCGGGCCA AAAAGGGCGA CTCCGCCACC
CTCGGCATCC GCCCCGAGCA CCTCGAGCTG CTCGGCGCCG AGGCCGAGGC CGACAACGCC
CTGGCCGGCG AGGTGCAGAT CGTCGAGCAT CTCGGCGAGG GCTCGTTCAT CTACGTCAAG
ACCTCGGTCC ACGAGCAGAA CCTCACGATC AAGGAAGAGG GCGACACCGC GGCCAGCAGC
GGCTCCAGCC TGCGCATGCG GCTGCCGCCC GAGAGCTGTC ACCTCTTCGA CCGCGACGAG
CAGGCCTTCC CGCGTCTGCA CAAGGCCAGC AAGCTGGCCG AGCTCAAGAT CGAGCGCTGA
 
Protein sequence
MASIKMRNVH KRFGNTHVIK GVDLDIEDGD FCVFVGPSGC GKSTMLRLIA GLEDISSGDL 
FIGEKKVNTV APSRRGVAIV FQSYALYPHM NVYDNMAFGL KLSRQGKDEI KQRVEEAAKI
LQIDHLLHRL PKELSGGQRQ RVAIGRAITR QPQVFLFDEP LSNLDAALRV QTRLELAKLH
ERLGTTMVYV THDQVEAMTL ADKIVILNAG HVAQVGAPLE LYHFPDNLFV AGFIGSPKMN
FIPCIVDEAD ADGAAITLSD GTRMRVPVDA ARAKKGDSAT LGIRPEHLEL LGAEAEADNA
LAGEVQIVEH LGEGSFIYVK TSVHEQNLTI KEEGDTAASS GSSLRMRLPP ESCHLFDRDE
QAFPRLHKAS KLAELKIER