Gene Hoch_1818 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1818 
Symbol 
ID8544200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2505918 
End bp2507696 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content70% 
IMG OID646386524 
ProductPEGA domain protein 
Protein accessionYP_003266259 
Protein GI262195050 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.679089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGGCGC CGACGCTGCG ATGTTGGGCG ACGCTGCTCG CCGCCTGCCT GTGTGTGGCC 
GCGCTGGGCG GGAACGCGGC CCGCGCTGAA GCGCCCGCGC CGGCGCCCGC GCCGATCGTG
GGCGAGGCCG ACGAGCCGCA GGAGCTGCTC GAGATCGAGT CGTGCATGCA GCCGCTCGAC
CTCGACGCGC GCGAGGCCGA GAACCGCGCT GCCGACCACT ACGACCGCGG CATTCAGCTC
TACGAACAGG GCGACTACGA GGCCGCGATC GAGGAGTTCG TCAGCGCCTA CTGTCTCAAG
CCCTACTACC GGGTGCTCAA GGACATCGCG CAGTCGTTCG AGCGCATGGT CAATTACGAA
AAGGCCGTGA TTTATCTCGA GCGCTATCTG GCCGGGATGC CGGCCGACGA GGTCGAAGAG
CGCCGGGTCC AGCGCGGCCG CGTCGAGGTG CTGCGGCGGC TGCCGGCGCG CATTCGCGTG
GCCACCTCGC CGGCCGGGGC CACGGTCACC CTGCGCGACG CCGGGCGCGT GGAGGCGCGC
GCGCTGGCCG ACGACGACAC GCCCATCGCC ATTCGCAAGG GCCGTTACCT GATGTCCGTG
GAGCACGATG GCTACGAGAC CATCACGCGG CCCATCGCGG TTGAAATCGG ACAGCCGTAC
AGTTTTTATT TTCAGCTCCA GCCCAAGACC GGGTCGCTGC GCATCAACGC CACGCCGGGT
GACGCGCGCA TCTTCGTCGA CAAGCGCTGG GTGGGCGTGG GCAGCTACGT GGGCGAGCTG
CCGCTCGGCG TCTACCAGGT CGAGGTCGAA CTCGATGGCT ATGTGCGCCA GACGCAGGAG
ATCACCATCG ACGATATCGG CGCCCACCGG ATGGCGATCA CGCTCGAGCC GCAGCCGCGT
TCGGGTCGCG GCGAGATGAT CGCGGCGGCT TCGGCCGGCG GTTCGCTGGC GGTCGGCTTG
GTCGCCGGTT CGCTGTTTCG CGGCCAGCGG GGCCTGGCCG CGGGGACCGG GTTGGCGCTC
GGCGGCGGGC TGGGCTTCGC CGGCACCTAC CTCAGTGTGC CCGGCGATCT CCCGGTCGGT
CACAGCTCGT TCATCATCGG CAGCAGTTTG ATCTCGGCGT TCGAGGCCGG CCTGAGCGCG
AGCCTGGTGG AGCGCCTGTT CCTCGACGAG TCGTGCAGCG ACGACGATCG TATGGAGGGG
GCTGCGGAGG ATGGGTTTGT GGGGCGAGTA AACGTCGATA GCTGCCGTTC GCGGGTGATC
CTCGGCGCCA GCATCGTGGG CGGCATCAGC GGCGCGCTGT TCGCCCTGGC CTCGGCCGAC
TATCTGCTGC TCGACGAAGG CGATGTGGCC GTGGTCCACT CGGGCGCGCT GTGGGGCACG
ATCGCGGGCG GGCTGCTGTG GTTCGCGTTC GACCGCGCGG GCGATCGCGG CGACGTCCTC
ACCTTGGCCA CGCTCAACAT GGGCCTGGCC ACCGGCGGCT TCCTGGCCGC GCGCGGCACC
TACAGCCGCA GCCACGTGAG CCTGGTCGAT CTCGGCGGGT TGGGCGGCCT GCTGGCCGGC
GCGACCAGCT CGCTGGTGTT TTACCGGGAT AGCTTCGAGG AGCGCGCGCC GCACTTCGCG
CTCGTGGGGA CGGTCGTTGG CCTGGCGCTG AGCAGCTATC TCACGCGCAA CATGGATGAT
CCCGACATCC GCGCGCCGCA GATCAAGCCG CTCTTCGGCC GCACCGAAGA CGCCGCCGGC
CGCTCCGTCG CCACCGTCGG CGCCTCGCTG CGCTGGTAG
 
Protein sequence
MPAPTLRCWA TLLAACLCVA ALGGNAARAE APAPAPAPIV GEADEPQELL EIESCMQPLD 
LDAREAENRA ADHYDRGIQL YEQGDYEAAI EEFVSAYCLK PYYRVLKDIA QSFERMVNYE
KAVIYLERYL AGMPADEVEE RRVQRGRVEV LRRLPARIRV ATSPAGATVT LRDAGRVEAR
ALADDDTPIA IRKGRYLMSV EHDGYETITR PIAVEIGQPY SFYFQLQPKT GSLRINATPG
DARIFVDKRW VGVGSYVGEL PLGVYQVEVE LDGYVRQTQE ITIDDIGAHR MAITLEPQPR
SGRGEMIAAA SAGGSLAVGL VAGSLFRGQR GLAAGTGLAL GGGLGFAGTY LSVPGDLPVG
HSSFIIGSSL ISAFEAGLSA SLVERLFLDE SCSDDDRMEG AAEDGFVGRV NVDSCRSRVI
LGASIVGGIS GALFALASAD YLLLDEGDVA VVHSGALWGT IAGGLLWFAF DRAGDRGDVL
TLATLNMGLA TGGFLAARGT YSRSHVSLVD LGGLGGLLAG ATSSLVFYRD SFEERAPHFA
LVGTVVGLAL SSYLTRNMDD PDIRAPQIKP LFGRTEDAAG RSVATVGASL RW