Gene Hoch_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1591 
Symbol 
ID8543973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2175607 
End bp2176746 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content66% 
IMG OID646386299 
Producthypothetical protein 
Protein accessionYP_003266034 
Protein GI262194825 
COG category[S] Function unknown 
COG ID[COG4627] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.761062 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCA TCTCGAAAGC AGCGAAATGG GCCAAGCGCA CCAAGCGCAA CATCAACGGC 
GGTGCCAAGC AGGGCGCGCA GGGGAAACGC GAGGGCGGCG AAGCCGGGGC CGGCAAGCAC
ACCTCGGTCG AAGCCATCGC CACGGCCGCG GTCGGAGCCG CGACCAAAGC GGCGGCGGCG
GCGGAGGCGG CGGCGGCTGC TGCTGCGCGC GCTGCCGAGC TGGCCGAGGC GGCCTCGGCC
AAGGCCGGCG CTGAGGGCGC GGGCAAAGGC GCGGGCAAAG GCGCGGCCAA AGGCGAGGGC
GGCAGCGACG ACATCCCGGC TCGCTACCTG CGCCTCGCCC GCCGCGACTG GCAGCAGGAC
CTCGAAAAGA ACGGCTTCTT CGAAAACGCC TATCGCGCCG GCTACTCGAA GCAGACGCTC
AAAGAGCGGC GCTTCATCAA CTTCGGCCCC GGCAGCTTCT TCCACAAGTA CTGGCAGAAC
GCCGACCGCT TGTACGCCGG CCGCACCTGG AGCGAGCAGC GCGGCAAGGG CTACAAGATG
AAGATCGACA TCGACTGGAA CTTGCTGGCC TGCGAGCCGG TCGCCGTCGA GGACGACAGT
ATCGCGGTCT GCTACTGCAG CCACCTCGTC GAGCACGCCT GGGATGACGC CGTGAAGTTC
TTCTTCCGCG ACGTGTTTCG CATCCTGCAG CCCGGCGGCG TCTTCCGCGT CACCTGCCCC
GATGCCGACC TCGGCATCCG CGCCTGGCAG AACGACGACC GCTACTTCTT CCTGCGCTAC
GCCGGTCGTC CGGTGGCCTT TGGCCTGCTC AACGACACCT CGTTGGTCAC CCACCCGGAG
AACTCGTTCC ACCTGCGCGC GCAGGAGGCC GAGGGCTTCA TGCAGAAGTT CGACGACCCG
TACGAGGCGC TGTCCGAGGC GTCGCGCCTA TCGGATCGCG ACCTGCAAAA CAAGGTCGCC
GCGCACGTCA ACTGGTTCAA TCGCAGCAAG TTGCGCGCGT TCCTCGGGGA GGCCGGATTC
ACCGAGATCC ACGACTCTTC GTACTCTCAG AGCTCGGTGC CCGTGCTTCG CGACGTGCGC
TATTTCGACA AGACCGACCC CCATATGACT TGTTACATAG AAGCGCGAAA GACGATTTAG
 
Protein sequence
MSIISKAAKW AKRTKRNING GAKQGAQGKR EGGEAGAGKH TSVEAIATAA VGAATKAAAA 
AEAAAAAAAR AAELAEAASA KAGAEGAGKG AGKGAAKGEG GSDDIPARYL RLARRDWQQD
LEKNGFFENA YRAGYSKQTL KERRFINFGP GSFFHKYWQN ADRLYAGRTW SEQRGKGYKM
KIDIDWNLLA CEPVAVEDDS IAVCYCSHLV EHAWDDAVKF FFRDVFRILQ PGGVFRVTCP
DADLGIRAWQ NDDRYFFLRY AGRPVAFGLL NDTSLVTHPE NSFHLRAQEA EGFMQKFDDP
YEALSEASRL SDRDLQNKVA AHVNWFNRSK LRAFLGEAGF TEIHDSSYSQ SSVPVLRDVR
YFDKTDPHMT CYIEARKTI