Gene Hoch_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2030 
Symbol 
ID8544412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2800489 
End bp2802048 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content68% 
IMG OID646386733 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_003266468 
Protein GI262195259 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0662893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.30492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACTT CTCCGCGCGA TCTCTTCGTC ATCCTCGGCA ACCAGCTCGT CCCCTTTCGC 
CATCTGCGCC CGCATCGCGA TGCGGCGTTC TTCATGGCCG AGGACCTGGG TTTATGTACG
TACGTCCGAC ATCACAAGCA GAAGATCGCC CTGTTCTTGG CCGCGATGCG CGCGCACGCG
GACGAACTGC GGCGCAACGG CTGCGCGCTG CACTACGAGT CGCTGGACGA GCAGGCCGGA
GCGGAGCTAC GCACCAAGTA CGAGACCAAG CTGGCCCGCT ACGCCGATCG CGCCGGACCG
TTCGACCGGC TGCTGAGCTT CGAGGTCGAG GACCTGTTCT TCGAGCGCCG TCTCGACGCG
GTGGCCGACG AGCTCGGGCT CGAGCGGGTG ACGCTGGCGA GTCCGATGTT TCTGTGCTCG
CGCGAGCGCT TCGCCGGGTA CGCGCGCGGG GCCACGCGCT TGCGCATGGC CGATTTCTAC
GAGCGCCAGC GCCGCCATCT GGGCATTTTG ATCGACAGCG AGGGCGCGCC GGTGGGCGGG
CGCTGGAGTT TCGACCGCGA CAATCGCGAA AAACTGCCGC GGGATGAGTC CTTGCCGGCG
GCGCCCGCGG CCGCGCCCAC CGATCACGTG CGCGCGCTCA TCGCCCTGGT CGGCGAGCGC
TTTGCCGACC ACCCGGGCGA ACTCAGCGAG GCGGGCTGGT GGCTGCCGAG TACGCGGCGG
CAGGCGCTGG CCTGGCTGCG CGGGTTTTTG GACGAGCGTC TCGAACGCTT CGGCGCCTAC
GAGGACGCGC TGTCCACGCG CGGGCCGGTG CTGTTCCACA GCGTGCTCAG CCCGCTGCTC
AACCTCGGCC TGATCACGCC GGACGAGGTG GTCGAGCGCA CGCTGGCGCA CGCCGAGGAC
CACCGGGTGC CGCTCAACTC GCTCGAGGGC TTTCTGCGCC AGATCATCGG CTGGCGCGAG
TTCGTACGCG GCGTCTACCG CGGCCATTCC GAGCAGCAGG AAACGGCCAA CGCCTGGGGT
CACCACCGCC GCATGAAGCC GTGCTGGTGG GACGCGAGCA CCGGGCTGCG CCCGCTCGAC
GACGCCATCG CCAAGGTGCT GCGGATGGGC TGGGCGCACC ACATCGAGCG GCTCATGGTG
CTGTGCAATC TGATGAACCT GTGCGAGATC GAGCCGCGAC AGGTGCACGA TTGGTTTCTG
GCCATGTTCG TCGACGCCGC CGACTGGGTG ATGGGGCCCA ACGTCTACGG CATGGGCCTG
ATGAGCGACG GCGGCCTGTT CGCGACCAAA CCGTATATCT GCGCCAGCAA CTATCTGCTG
AAGATGAGCG ACTACGGCCG GCCGGCGGCG GGCGAGGTGT TCCCCTTCGG CGACAGCGAC
TGGTGCACGG TCGTCGACGG CCTGTACTGG CGCTTCGTGC GCGAGCACCG CGACTTCTTC
GCCGGGCACC CGCGCATGGC GGTGATGGCG GCTTCCCTCG ACCGCATGTC CGGGGACAAG
AAGCAGCGTA TCTTTGCGGC CGCGTCGGCG TTTCTCGACC GGGTGACCGA GACGCCCTGA
 
Protein sequence
MATSPRDLFV ILGNQLVPFR HLRPHRDAAF FMAEDLGLCT YVRHHKQKIA LFLAAMRAHA 
DELRRNGCAL HYESLDEQAG AELRTKYETK LARYADRAGP FDRLLSFEVE DLFFERRLDA
VADELGLERV TLASPMFLCS RERFAGYARG ATRLRMADFY ERQRRHLGIL IDSEGAPVGG
RWSFDRDNRE KLPRDESLPA APAAAPTDHV RALIALVGER FADHPGELSE AGWWLPSTRR
QALAWLRGFL DERLERFGAY EDALSTRGPV LFHSVLSPLL NLGLITPDEV VERTLAHAED
HRVPLNSLEG FLRQIIGWRE FVRGVYRGHS EQQETANAWG HHRRMKPCWW DASTGLRPLD
DAIAKVLRMG WAHHIERLMV LCNLMNLCEI EPRQVHDWFL AMFVDAADWV MGPNVYGMGL
MSDGGLFATK PYICASNYLL KMSDYGRPAA GEVFPFGDSD WCTVVDGLYW RFVREHRDFF
AGHPRMAVMA ASLDRMSGDK KQRIFAAASA FLDRVTETP