Gene Hoch_2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2053 
Symbol 
ID8544435 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2839440 
End bp2840780 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content73% 
IMG OID646386756 
Producthypothetical protein 
Protein accessionYP_003266491 
Protein GI262195282 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.567279 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.709257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCGCA TCCAACATCG TCCCCCGCTC GCCCCGCCGG GATACCGCGC CGCGGCGCCC 
CGCGGCCGGG GTCGCGGCCC CCACACGCGT GGGCTGGCCC TCGCCTCGCT GGCCCTCGCC
TCGCTGGCGC TCGGCGGCCT GCTGCTCGCC TGCGGCGGCG AGGGCGACAG CGCCCCCGAC
GCCGGCAGCG CGGACGCGAG CATCTCGGAC GCCCGTCCCA GCGACGCCGG CGACGCGGAC
GGCGGCGGCA ACGCGCTGTG CACGAGCGGC GCGCTCACCA GCACCACCTG GCGCGACTAC
GGCTCCCTGG CCACGCATGC CGCGTCCATG CTGGTGTTCG ACGATCAGCT CTGGGTGGGC
ACGGACGACG GCCTGTGGTC ACACCCGCTC ACCGACGACG ACGGCGGCGA CGGCGACCTG
TGGCAGCAGC GCGCGCTGGC GGGCCGTCGC GTGAGCGCCC TGCGCGTGCT CGACGCCGAG
GCCGGCACCC TGCTGGCCGG ACTCGCGTCC GCCGAGGCCG CAGCGCAGAC CGAGCCCGCC
TTCGCGCTGT CGAGCGACCG CGGCCAGAGC TTTGCCCTCT ACGGCGCCGA GCTCGGCTAC
GACGACGCCG GCACGCGCCG CTACGACGCG GTCAACGACC TGGCCGTGCA CCGCAGCGGC
GCCATCTACG CGGCCATGTC CGGGGTGTCG ATCGCGCGCT CGAGCGACGG CGGCCAGAGC
TGGAGCTACG TGTTCGGCCA GCCCGCCCAG ATCTGCTATC CGTGCCGCCT GCACATCGCC
GCAGGCGCGC CCGACGCCCT GTACCAGGGC TGCGAGTGCC CGCTCGACAT GGCCTCGATC
GACCGCTTCG CGCTGCCCGA GAGCGCCGAC GGCGGGTTCC CCGACCAGGG CGAGCGCCTG
CTCGACTACC GGGACATCGG CAACCGCCGC ATCAACTCCT TCGCCAGCAC CGACGCGTAT
CCCGGGCGGG TCTACGCCGG CGTGGAAGGC GCGCTGCTGT GGCTCGAGGG CGCGGACGAG
TGGGACTACC TGTATCGCTC GATGGGCGCC GAGAAGCTGT ACACCTACGT CGAGGCCATC
TGGATCGACC CCTGCGACCC GGCGCATATC GTCTTCGGCG GCGGCGAGCA GAGCGAAAAC
CAGATGCTGA GCCTGTTCGA GAGCTACGAC GAGGGCGTGA GCTGGGAGAT GCTGATGCCG
CCGGGGCTGA GCTTCGATCA GGCCGTGGTC GAGCGCGGCC TCAGCGCGGG CGCGAGCGGC
GAGCACGCGA TCCTGGCCGT GTGGACGAAT TCCGACGGCG CCAAGAGCGT GCGCATCCTC
GCCAAGCGGC ATTCGCCCTG A
 
Protein sequence
MDRIQHRPPL APPGYRAAAP RGRGRGPHTR GLALASLALA SLALGGLLLA CGGEGDSAPD 
AGSADASISD ARPSDAGDAD GGGNALCTSG ALTSTTWRDY GSLATHAASM LVFDDQLWVG
TDDGLWSHPL TDDDGGDGDL WQQRALAGRR VSALRVLDAE AGTLLAGLAS AEAAAQTEPA
FALSSDRGQS FALYGAELGY DDAGTRRYDA VNDLAVHRSG AIYAAMSGVS IARSSDGGQS
WSYVFGQPAQ ICYPCRLHIA AGAPDALYQG CECPLDMASI DRFALPESAD GGFPDQGERL
LDYRDIGNRR INSFASTDAY PGRVYAGVEG ALLWLEGADE WDYLYRSMGA EKLYTYVEAI
WIDPCDPAHI VFGGGEQSEN QMLSLFESYD EGVSWEMLMP PGLSFDQAVV ERGLSAGASG
EHAILAVWTN SDGAKSVRIL AKRHSP