Gene Hoch_5157 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5157 
Symbol 
ID8547568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7104303 
End bp7105541 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content68% 
IMG OID646389833 
Producthypothetical protein 
Protein accessionYP_003269538 
Protein GI262198329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.376133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.739341 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCGCC AGACCCTCAC CATCGCCCCG GGCAACCCCG ACTTCCTCGA CCTGCCCTGG 
GACCAGCCGC TCGCGAGCTG GGACTCGCCC CGGCTGATCG AGCTGCCCAA GGGCGTTTCT
CGACACCCCG TCCGATTTCT CGCCTACGAG CAGGGCATCT ACGTGGTCAA GGAGCTGGCC
AGGAACGCGG CCCGCTGCGA CTACGAAGTG CTGCGCGCGC TCGAGGCCAA GGAGCGGCTG
TTGGCGGTGC CGCCCGTGGG CCTGGTCGAG CACCGTCACC CCGACGCCAG CGCCGAGCAC
TCGGCCGCGC TCATCACGCG CTATGTCGAG TATTCGTTCT CGTATCGCGA GCTGTTGCAG
GGCGCGGGCT TTGGCGAGCG GCGCACGCAG ATGCTCGACG CCTTTGCCGG ACTGCTGGTC
GAGTTACACG TCGCCGGCTG CTACTGGGGC GACTGCTCGC TGTCCAACGT GCTGTACCGC
TTCGACGCCG AGGCCGTGGA CACCGTCATG GTCGACGCCG AGACCGCGTT CATGCGCGAC
GCGCTCAGCG ACGGTCAGCG CGAGCAGGAT CTCGACATCA TGATCGAGAA CGTGGCCGGC
GGCATGGCCG ACATCGCGGC CTCGCAGGGG CTGGTCCTCG ACGACGCCGA CCTGTGGCTC
GGCGAGGACA TCGCCGCGCG CTATCGCAAG CTGTGGGACG AGCTGAGCCG CGTGGAGCTC
ATCGGCCCGG GCGAGCGCTA CAAGATCACC GAGCGCATCC GGCGCATCAA CGAGCTCGGC
TTCGTGGTCG AGGAGATCGA CCTCGTGCCC GCGGACGGCG GCGAGCGGCT GGCCCTGCGC
GCGCACATCG GCGGTCGCAA TTACCACTGC AACCAGCTCA AGGCGCTCAC CGGGCTCGAC
ACCTCGGAGT GGCAAGCGCG CCAGATCCTC TCGGACCTGC GCCACTTCCA GGTCCGCAGC
GCGGCGACCA CGAGAAACAG CAACAGCAAC AGCAACAGCA ACAGCAACAG CAACAGCAAC
AGGGACGTGG CCGCGATCCT GTGGCGCGTG CGTCGCTTCG AGCCCGCGCT GGCGCGCATC
CGGGCCACCC CGGGCGTGAG CGACCCGGTG CAGGGCTACT GCGACCTATT GCACTACCGC
TACCTCAAGT CCAAGGACGC CGGCTACGAC CTCGAGACCG AAGCCGCGCT CGAACAATGG
TTTGCGGACG GCCGTCCCGG ATACCTGATC CGCCCCTGA
 
Protein sequence
MPRQTLTIAP GNPDFLDLPW DQPLASWDSP RLIELPKGVS RHPVRFLAYE QGIYVVKELA 
RNAARCDYEV LRALEAKERL LAVPPVGLVE HRHPDASAEH SAALITRYVE YSFSYRELLQ
GAGFGERRTQ MLDAFAGLLV ELHVAGCYWG DCSLSNVLYR FDAEAVDTVM VDAETAFMRD
ALSDGQREQD LDIMIENVAG GMADIAASQG LVLDDADLWL GEDIAARYRK LWDELSRVEL
IGPGERYKIT ERIRRINELG FVVEEIDLVP ADGGERLALR AHIGGRNYHC NQLKALTGLD
TSEWQARQIL SDLRHFQVRS AATTRNSNSN SNSNSNSNSN RDVAAILWRV RRFEPALARI
RATPGVSDPV QGYCDLLHYR YLKSKDAGYD LETEAALEQW FADGRPGYLI RP