Gene Hoch_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3551 
Symbol 
ID8545941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp4888617 
End bp4889672 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID646388220 
ProductYbbR family protein 
Protein accessionYP_003267946 
Protein GI262196737 
COG category[S] Function unknown 
COG ID[COG4856] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00859415 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0365971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGGC TCGACGAACG CCGCGCCAAG CGTCAGCGCC GCAGCGGGGT GTGGCCGCGG 
CTCAGCCCGC CGAGCAAAGA GGAGCGCAAG GCCTCCTGGG ACGACGTCCG CCAGGGCCTG
CGCCAGATGT TCACGCGCAA TCCGCTGCTC AAGCTGGTCT CGTTGGTGCT GGCGCTGTCG
CTGTTCTTCC TGGTCAACAC CGACCGCGAC GCGATCATCG GCGTCAACGT CGATGTCTCC
TACCAACTGC CCGAGAACCG CGTGCTGGTG TCGCAGCCGG TAGACCAGGT GCGGCTGTCG
ATCCGCGGGC CCTGGCGGCG TATCAAGCGC TTCGACGAGC GCGAGATCGA CCGCATCCTG
GTCGATCTGA CCAACGTCCA GGATGGTCCG TTCACCTTTC CCGAGGACGA GGTGGTCCTG
CCCGAGGACC TGACCCTGCT GTCGATCAAC CCGCCGACCA TCAACGTGGC CTTCGAGCCC
CGGGTGCAGA AGACCGTGCC GGTCGAGGTC GCCACCCAGG GCGAGCCCGC GCGCGGCTAC
GAGGTCCAGC GCATCCTGCC AAAACCCTCG CAGGTGACGA TCCGCGGCGC CGAGACGCGG
GTGCGCGAGA CCAACCGTGT GCACACGCGC GAGCTGCGCC TCGACGGCCG CACCGATTCG
TTTACCGAGG TGCTGCCGCT GGAGCCGCCG CGCACCGAGC CGCGCTCGCT GATCGAAATC
GCCGACCGCG TGCCCATCGA GGTCGAGGTG ATTCTGGCGC CCGAGATGGG CACGCGCACC
ATCGAAGACG TGCCCGTGCG CATCGTGGCG GGCGAGGGCG TGAGCGAGGC GGTCGAGGAG
CGCTTCGCGA CCGATCCGGC CACCGTGGAT ATCGTGCTGC ACGGGCCGCT GCTGGAGATC
GAGAGCTTCA GCGGCGAGGT CACGGCGGTG GTGAGCGTGC ACGCCGAGGA CGGCACCGCG
CGGCCGCGCA GCGCCGACAT CCAGGTGCGC AACGTGCCCG CCGGCGTCGG CACCGAGGTC
AAGCCGCCCG CCGTGACCCT GCAGGGCGCG CGCTGA
 
Protein sequence
MSWLDERRAK RQRRSGVWPR LSPPSKEERK ASWDDVRQGL RQMFTRNPLL KLVSLVLALS 
LFFLVNTDRD AIIGVNVDVS YQLPENRVLV SQPVDQVRLS IRGPWRRIKR FDEREIDRIL
VDLTNVQDGP FTFPEDEVVL PEDLTLLSIN PPTINVAFEP RVQKTVPVEV ATQGEPARGY
EVQRILPKPS QVTIRGAETR VRETNRVHTR ELRLDGRTDS FTEVLPLEPP RTEPRSLIEI
ADRVPIEVEV ILAPEMGTRT IEDVPVRIVA GEGVSEAVEE RFATDPATVD IVLHGPLLEI
ESFSGEVTAV VSVHAEDGTA RPRSADIQVR NVPAGVGTEV KPPAVTLQGA R