Gene Hoch_3185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3185 
Symbol 
ID8545573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4388666 
End bp4389826 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content69% 
IMG OID646387852 
Productextracellular solute-binding protein family 3 
Protein accessionYP_003267580 
Protein GI262196371 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0319469 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.327167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTTCC CCGTCTGCCT GCTCGTGTCC CTGCTGTGCG CGTTCCCGGT CGCCGCCTCG 
GCGCAGACCC AGACCCCGCC AGCGACACCG GCTGAGCCAG CGGCTGAGAC GGGCGACACG
CTCGCGCAGG CGGCCGAGGA TGCGGTCGAA CGCATCGAGC GCGATGAGCT GCGCATCGGT
ATCAGCACGT TCCCGCCCTT CGTGCTCACC GGAGGTAATC CCCACAGCGG CTTCTCGATC
GAGCTGTGGC GGCTGGTCGC CGAGAGCCTG GACGTCGATT ACACCTTCGT CGCCAGCACC
GGCGTGGCCG ACAAACTCGC GCGCCTGCGC GGCGACCAGC TCGACGTCGC CATCGGCGGC
ATCACGGTAA CCACCGAGCG CGAGCGGCTG GTCGATTTCA CCCACCCGGT CACCGACAGC
GGTCTCGGCA TCCTGGTCCG CGAAGGCGAA GGCGGCGGCG CCGGCTTTTT CCAGCGCATC
ACCTTCAACG ACAGCAAATG GGGCCTGGTC ATCGGATTTT TGGCCCTGGT CATCGTCGCC
GGCAACCTCA TCTGGTGGGC CGAGCGGGGC CGCGAATCGT TCAGCGATAA GTACTTCCCC
GGCGTCTTCG AGGGCATGTA CTGGGCCATC GTCACCGCCA GCACCGTGGG CTACGGCGAC
AAGACGCCGA CGAGCTGGCG CGGCCGCGCG ATCGCCGGGC TCACCATCGT CATCACGCTG
CCGCTCTTCG CCCTGTTCAC GGCCGAGCTG GCCTCGACCA TCACGGTCGC CGAGATCCAA
TCGCGCATCG ACGGACCCGA GGATCTGCGC GACAAGCGCG TCGGCGTGGT CCGCGGCACC
GTGGCCGCGG ATTGGGCCGC GGGCTTCGGC CTCGAACTCG TCCAGTGGGA CGGAATCGGC
GAGGTCTACG ACGCGCTCGA TCGCGAGGTC GTGGACGCCG TCATCCACGA CGCGCCCAGC
CTGCAGTACT ACGCGCAGAA CCAGGGCAAG GACGACGTGC AGGTGGTCGG CGGTCTGTTC
CAGGCGCAGT CCATCGCCTT CGCGCTGAAC GAAGGCTCGC CGCTGCGCGA GCCGCTCAAC
CGCGCCCTGC TCTCACTGGT CGAGTCGGGC GAACTCGAGC GGCTGCGCGT GCGCTGGTTC
GGCACCGGTG CGCGCAAGTA A
 
Protein sequence
MRFPVCLLVS LLCAFPVAAS AQTQTPPATP AEPAAETGDT LAQAAEDAVE RIERDELRIG 
ISTFPPFVLT GGNPHSGFSI ELWRLVAESL DVDYTFVAST GVADKLARLR GDQLDVAIGG
ITVTTERERL VDFTHPVTDS GLGILVREGE GGGAGFFQRI TFNDSKWGLV IGFLALVIVA
GNLIWWAERG RESFSDKYFP GVFEGMYWAI VTASTVGYGD KTPTSWRGRA IAGLTIVITL
PLFALFTAEL ASTITVAEIQ SRIDGPEDLR DKRVGVVRGT VAADWAAGFG LELVQWDGIG
EVYDALDREV VDAVIHDAPS LQYYAQNQGK DDVQVVGGLF QAQSIAFALN EGSPLREPLN
RALLSLVESG ELERLRVRWF GTGARK