Gene Hoch_5179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5179 
Symbol 
ID8547591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7129886 
End bp7131562 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content75% 
IMG OID646389856 
ProductTonB family protein 
Protein accessionYP_003269560 
Protein GI262198351 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0226] ABC-type phosphate transport system, periplasmic component 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain
[TIGR02136] phosphate binding protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCCG TCGCCTTTGC CGACGTGCTC CAGATGCTGG CCTCGAATCA GACCACCTGC 
CGCATTCTCC TGGGCGCAGA CGGCGTACTC GGCGAACTCT ACCTCGAGGA CGGCGCCCTG
GTTCACGCGC AGCACGGCGA GCTCGAGGGC AACGAGGCCG CGTTCGCCTT GATCGCGGTG
AGCGGCGGCA CCCCCTTCGA CGTCCAGGAC GATCAGCCCG CGCCCCGGCA CACGGTCGAG
GACGACATCG GCTATCTCAT CCTCGAGGCC GCCCGGCGCC GCGACGAGGG CTTGCTGCCG
CCGCCGGACG CGATCGTCCG CCTGCCCCCC GAGGCCGGCC CGGAGGTGGC GGCGCAGGTG
CAGCCTCGGC GCGCGCTGCG GGCGGTGCTG GCGGCCTCGG TGCTGGCGGT CCTGATAATG
GGCAGCTATC TGGCGCTGGC GCCGGTGGCC GCGCCCACGC CGACGGCGAC CGCCGCGGAC
GCCGCCGCGG ACACCGCCGC GGCTGCGCCC GACCCCGCCG GCAGCGAGGC CGCCGCCGAA
GAGCCTGTGC TCGACCTCGA CCTCGACCGC GGCCCGGTGT TTCTCGACGG CCCGCCCGCG
CTGCCGCCGG TCGAGGCCGC GCCGCTGTCG CCGACCGTGC TGTGCCGCAT CCGCGTCGAC
GAAACCGGCC GCGTGTCCGA AGCCGTCGTG TATCGCTCGC GCCCGGCCCT GGCCGCGTAC
GAAAACGCCG CCCTCGCCGC CGTTCGCGCC TATCGCTTCG CCCCCGCCAT GCGCCGCGGC
AAGCCGGCCG CCGCGTGGCT CAACTGGCCC GTGCACGTGG CCGCGCAGCG CAGCGAGATG
CTGGCCATCC GCGGCAGCGA GACCATCGGC GAGGCCCTGC TGCCGGCCCT GGCCGACGCC
TATCGGCAGC GCCACCCCGA GGCCTCGGTG GCGCTCGCGG CCTCGGGCCC CGGCGGCGGC
GTGGACGCGC TGCTCGCCGG CACCGTCGAC ATCGCCGCGG CCTCGCGCCC GATCTCGGCC
GACGAGCTGG CGCGCGCGGC CGCCCGCGAG CTGAGCATCG AGGAGTTCGT CATCGGCTAC
GACGGCGTCG CCATCATCGT CCACCCCGAC AACCCCGTGC GCGCGCTCGA TCTATCCCAG
CTCCGCGCGC TGTTCTCCGG ACAGGTGGCG AGCTGGAGCG CCCTGGGCGG CGCCGACCGC
GCGGTGCAGC TCTTCGGCCG CCCCCAGGGC GCGGGCACTC GCGCGCTGTT CGAGGCCATG
GTGTTTCAGC GCCCCGACAC CGGCGCCGAG GCCAACGACG GCGACGGCGC AGCGGCGAGC
TTCGCTCCGG GCGTGCGCGA GCCCGCCAGC AACCGCGAAC TCATCGCCGC CGTGGCCGCC
GATCCCGGCG CGGTGGCGTA CCTCAGCACG AGCTGGCTAC GGCCTGAGGT GGTCGCCGTC
GCCCTGTCCG AAGAACCGGG CGGCGACGCG GTCGAGCCCA CGCCGGAGTC CATCCGCACC
GGCCGCTACC CCTTGCACCA CCCGCTGCAC ATGTACGCGC GCGGCCCGCT CGACAACGCG
GTCGCGCGCT TCCTGCTGTT CGCGCTATCC CCGGGCGGAC AAGCGATCGT TCGCGCGCAC
GGCTTCGCCG ACGGCGCCCT GCCCCTCAAC TTCGCGCTCG AACAGCTCGC GCGCTGA
 
Protein sequence
MRAVAFADVL QMLASNQTTC RILLGADGVL GELYLEDGAL VHAQHGELEG NEAAFALIAV 
SGGTPFDVQD DQPAPRHTVE DDIGYLILEA ARRRDEGLLP PPDAIVRLPP EAGPEVAAQV
QPRRALRAVL AASVLAVLIM GSYLALAPVA APTPTATAAD AAADTAAAAP DPAGSEAAAE
EPVLDLDLDR GPVFLDGPPA LPPVEAAPLS PTVLCRIRVD ETGRVSEAVV YRSRPALAAY
ENAALAAVRA YRFAPAMRRG KPAAAWLNWP VHVAAQRSEM LAIRGSETIG EALLPALADA
YRQRHPEASV ALAASGPGGG VDALLAGTVD IAAASRPISA DELARAAARE LSIEEFVIGY
DGVAIIVHPD NPVRALDLSQ LRALFSGQVA SWSALGGADR AVQLFGRPQG AGTRALFEAM
VFQRPDTGAE ANDGDGAAAS FAPGVREPAS NRELIAAVAA DPGAVAYLST SWLRPEVVAV
ALSEEPGGDA VEPTPESIRT GRYPLHHPLH MYARGPLDNA VARFLLFALS PGGQAIVRAH
GFADGALPLN FALEQLAR