Gene Hoch_2792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2792 
Symbol 
ID8545180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3832106 
End bp3833377 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content55% 
IMG OID646387484 
Productrestriction modification system DNA specificity domain protein 
Protein accessionYP_003267212 
Protein GI262196003 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.313089 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGATG AGCTGCAAGT GCCAACGCGA TGGAGACGTG TTCGCTTGCT GGATCATGTA 
GATCTTCCCA GCGGTCAGGT CGATCCACGT GACCCTCAAT ACCGCTCGCA ACCCTTGGTT
GCCCCAAACC ACATAGAATC GCAGACTGGA CGGCTCCTTG CACTCGAAAG TGCTGAGTCG
CAAAATGCTA TCAGCGGTAA ATACACGTTT TCTGCTGGGG ACGTTGTTTA CAGTAAAATT
CGTCCCTATT TGCGGAAAGC GATCCTCGCG TCGTTTGACG GGCTGTGCAG CGCAGATATG
TATCCGTTGC GCGCGAAAAC TTCCGTCGAG CCCGGGTTTC TTCTTGCTCT TCTCCTAGGC
GAAGAATTCT CCTCATTCGC AGAGTCTGTA TCGATGCGGA CCGGTATTCC GAAGCTCAAC
CGGAAAGAGT TGGGTTCGTA TCACGCTCGG CTGCCGCCGT TGGGGGAGCA GCGGAAGATT
GCGGCGATCT TGGGGGCGGT GGATGAGGCC ATCGCCAGGA CCCAGGCGGT CATCGAGCAG
GTGCAGGTGG TCAAGAAGGG CCTCATGCAA GATCTGCTCA CCCGCGGCCT CCCCGGCCGC
CACACCCGCT TCAAGCAAAC CGAAATCGGC CAAATCCCCG AATCGTGGTC AGCCGTCCGG
TTGGGCGACG TACTCGATGG CATCGATGCC GGCTGGAGTC CTAAGTGCGC CAATCATCCT
GCAGGCAATG GTGAATGGGG TGTGCTCAAA GTGAGTAGCG TATCGTCGGG AATATACAAG
CCCGAAGAAA ACAAAATGTT ACCCGATGAT CTCATCCCCA AGCCCGAGTT GGAGGTGCGC
CCGGGTGATG TCATCATCGC ACGAGCTAGT GGAGTGCTAG ACTTGGTCGG CGTCTGCTCA
TTTGTATATA AAACGCGTCC TCGTCTAATG CTTTCCGACA AAACGCTACG GGTGCGGCCG
AACCGCACCC TGTTGGATAG CTTCTATCTT GCGCTGACTC TCCAAAGTCC CGTTGTGCGG
AGCTTAGTTT TAGAGAAAGC AACTGGTAGT CATATGCGCA ATATATCCCA AAAGGCCATC
GGTTCAGTCA CTGTGGCGCT TCCGTCATTG GATGAACAGG TCAAAGTATC GAGTGGTATT
ATGGCTATGG ACGCTCGGAT CGATAACGAT ACTCGAAGTG TAGAGTCTTT GACTGAACTG
AAATCCGCCC TCATGTCCGT CCTGCTCACC GGCGAAGTCC GGGTCACGCC GGACGAGGAG
AGTGCATCAT GA
 
Protein sequence
MADELQVPTR WRRVRLLDHV DLPSGQVDPR DPQYRSQPLV APNHIESQTG RLLALESAES 
QNAISGKYTF SAGDVVYSKI RPYLRKAILA SFDGLCSADM YPLRAKTSVE PGFLLALLLG
EEFSSFAESV SMRTGIPKLN RKELGSYHAR LPPLGEQRKI AAILGAVDEA IARTQAVIEQ
VQVVKKGLMQ DLLTRGLPGR HTRFKQTEIG QIPESWSAVR LGDVLDGIDA GWSPKCANHP
AGNGEWGVLK VSSVSSGIYK PEENKMLPDD LIPKPELEVR PGDVIIARAS GVLDLVGVCS
FVYKTRPRLM LSDKTLRVRP NRTLLDSFYL ALTLQSPVVR SLVLEKATGS HMRNISQKAI
GSVTVALPSL DEQVKVSSGI MAMDARIDND TRSVESLTEL KSALMSVLLT GEVRVTPDEE
SAS