Gene Hoch_5667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5667 
Symbol 
ID8548081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7777983 
End bp7780223 
Gene Length2241 bp 
Protein Length746 aa 
Translation table11 
GC content74% 
IMG OID646390335 
Producthypothetical protein 
Protein accessionYP_003270037 
Protein GI262198828 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.627177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0881924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGACA CCGAGGCTCG AATTCCCAGG TCGCTGATCG AGGCGGTGCA GGAACACCGC 
CTGGTGCCCT TTATCGGCGC CGGCGCCAGC ATGCACGTGG CCCGCTCGCT GTTTCCCTCG
TGGTATCGGC TGCTGCAGGA GATGGTGGCC GAGATGGACG CCGAGGGCCT GCCGCTGGCC
GAACTCGAGC GGGTCAAGAC CCTGATCCAA GCCGACGAGT ACGTGCCCGC GGCCGAGCTC
GGCTTTCGCG CGCTCGGCGC CCAGCGCTTC CACAAGTTCC TGCGCCAGCG CTTTCGCCAC
CTGCCGCCGG GCGACGCCCA GCTCGACCTG GTCGAGGCGC TGTGGGCGCT GCGGCCGGAG
CTGATGATCA CGACCAACTA CGACAGCGTG CTGCGCTGGG CCGGGCCCAA GGACGTGCAG
ATCGTGGCCA ACGATCAGAG CGAGGAGCTG GGCTTCCTCA GCGCCGACGC CTCGCCCGAG
TGGCCGTGGA TCTGGCACCT GCACGGTACG ATCGAGCGCA TGGCCACGGT CATCCTCGCC
GGCAGCCAGT ACGAGGAGCT GTACGGCGAC GAGGACGGCC GCGGCCACGC GTATCAGCGC
GCGCTCTTTG AGCTGCAAAA GACGCTGTCG GCGCGGCCGC TGCTGTTCGT CGGCTTTGGG
CTCTCGGATC CCTACGTGCT GCAGCAGATC CGCCACGTCC TCGACATCAC CCGCGGCAAC
CAGCAGCCGA GCTACGCGCT GCTGAGGCGC GGCGAGGCCG ACTACGGCGA CCTGTGGGAG
AATCATCATA TTCAGCTCCT CGAGTACGCC GACCACGGCG CGCCCCTGGT CGAGCTTGTG
CGCGCGATCG CGGCCCGGGC CTGGCCTGAG GATGAGGACA AGGCGGCCGA GGAGACGCCG
AGAGAGCGCG GGCTCGACTT GAGCGAGATG GACCTGGGCG CGCCGCTGTC GTACGCCGAG
CGCGCGCCGG CCGAGCGCTC GGCGGAGCGC GTCGTGCGGC GCGATGCCGC TGATCTCTTG
CCCGGCGGCG GCGCGATGGG CGGCGATTCG TCGGGCGAAC CGGGACGTCC GCGCAGCAGC
CCGGCGGTGG ATAGCAACCT GATGCCGCAG GCGCAGCCGC GTCCGAAGGC GCCGCCCGCG
CCCGAGCCCG CGCCCGAGCC GTCTTCGCCG CATCGCGGAC CGGCGCCAGC GGCGGCGGCG
CAGCTGGCCC CGGCCGGGGA CGACGCGGAC GAGTTCACGG GCGAGCTGAA CGACGCAGCC
GAGGCCGCGC CGCCGCCGCC GGTGCTCGAG GGCGCGGCGT CGGGGGTCGG TCTGGCGCCG
CGCGTGGCCC TGGTGGCGTC GCTGGCCAGC GAGCTGGTCA CCGGCCAGCG CTTGCTCCTG
CTCGGGCCGC GCGGCGGCGG CGTGCACACC CTGGCCGAGC AGATCGCGGC GGCTCGGTTC
CGCGCCCGCA CCACCTGGCT CAACCCGCCC AGCGCGCCCG AGTGCACCGA GGCCGAGTAC
TGCGCCTTTG TGAGCGGCGA CGCCCGCGCC GACTGCTTCG CCGCGCTGCA CCGCGTGCTC
GAGGAGCGCG CGCGCGCGGC CGGCGGCGAG CTGCTGGTGG TGCTGCGCCA CGACGGCGGG
CCGCTGCATC ACCTCGAGCG GCTCGGCGAC CTGCTGCGCT CGCTGCTCGA CGAGGGCCGG
CAGCGCGGGC TGGCCATCTT CGCCCTGGTC GCCGGCGGCG CGCCCGCGGC CGAGCTGCGC
TACCGGGCGC TCGAGACCTC GCTGTTCTCG GGCGCGCCCG TGCGCCACGT GCCCATGCTC
AACGGAGGCG AGGTCGCCGA GGTGCTGGCC AAGGCGGGCC GCGATCGCGG CCTTGCTTCG
GACGTGTGGA CGGCAACCGG CGGGCTGCCG CGGCTGGTGC GTCAGGCCCT GGCCGGCGGC
GGTTCATTGC GCGAGGCCGA CATCAGCGCG CGTCTGCGCG ACAGCTCGGC CATCCGCGGT
CGCCTGCACC AGCGCCTGCT GGCCGACGAT CGCGAGCAGG TGCCCGCGGC CCGGCACGCG
CGCAGCGCGC TCGCGACCCT GCTCGGCGGC GGCGCGCTCG AGCCCTTGCG CAAGGTCGAG
GACGAGCTGC GCTACGCCGA GGTGCGCCTG TACTACGATG GTCTCGTCCG CGCGGACGAA
AACGGCGCCA CGCGCGTTCT GTGCCCGGCC GTGGCCGCGG CCGCCGAGCG GCTGCTGGCG
CGCGAGGGCG GGCGCGGGTG A
 
Protein sequence
MSDTEARIPR SLIEAVQEHR LVPFIGAGAS MHVARSLFPS WYRLLQEMVA EMDAEGLPLA 
ELERVKTLIQ ADEYVPAAEL GFRALGAQRF HKFLRQRFRH LPPGDAQLDL VEALWALRPE
LMITTNYDSV LRWAGPKDVQ IVANDQSEEL GFLSADASPE WPWIWHLHGT IERMATVILA
GSQYEELYGD EDGRGHAYQR ALFELQKTLS ARPLLFVGFG LSDPYVLQQI RHVLDITRGN
QQPSYALLRR GEADYGDLWE NHHIQLLEYA DHGAPLVELV RAIAARAWPE DEDKAAEETP
RERGLDLSEM DLGAPLSYAE RAPAERSAER VVRRDAADLL PGGGAMGGDS SGEPGRPRSS
PAVDSNLMPQ AQPRPKAPPA PEPAPEPSSP HRGPAPAAAA QLAPAGDDAD EFTGELNDAA
EAAPPPPVLE GAASGVGLAP RVALVASLAS ELVTGQRLLL LGPRGGGVHT LAEQIAAARF
RARTTWLNPP SAPECTEAEY CAFVSGDARA DCFAALHRVL EERARAAGGE LLVVLRHDGG
PLHHLERLGD LLRSLLDEGR QRGLAIFALV AGGAPAAELR YRALETSLFS GAPVRHVPML
NGGEVAEVLA KAGRDRGLAS DVWTATGGLP RLVRQALAGG GSLREADISA RLRDSSAIRG
RLHQRLLADD REQVPAARHA RSALATLLGG GALEPLRKVE DELRYAEVRL YYDGLVRADE
NGATRVLCPA VAAAAERLLA REGGRG