Gene Hoch_5541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5541 
Symbol 
ID8547955 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7601603 
End bp7602676 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content58% 
IMG OID646390215 
ProductAppr-1-p processing domain protein 
Protein accessionYP_003269917 
Protein GI262198708 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0616045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGTGGG CGTTCACACT CTCCAACGCC GGTGCCTACT ATTTCGCCGA AGATTGTGAA 
GCGATCGTCA ACACGGTCAA CTGTGTCGGC GTCATGGGTC GAGGCATCGC ATTGCAGTTC
AAGAAGGCCT ACCCTGAAAA TTTCAAGGTC TACGCCGCTG CCTGCAAGCG AAAGGAAGTC
CAGCCGGGCC AGATGCTGGT GTTCAGGACG GGGCGATTGA TGAACCCACG GTACATCATC
AACTTTCCAA CCAAGCGACA CTGGCGCGGT AAGAGTAGAA TAGAGGATAT CGAGTCGGGT
CTTGTAGCAT TGGCCGATGT GCTCGGCGCT TGTAGAATAA GGTCGATCGC TATCCCGCCG
CTTGGGGCAG GCTTGGGCGG CCTTGACTGG ATGCAGGTTC GTGAGCGGAT CGAAGCAGCT
TTAGGCGGCT TGGAAGATGT CCAGATCGTG GTCTTCGAGC CGCGAGCGGC AACTGCGAGC
GAACGACCGA ACCGTTCTCG CGAGGTGCCC GGGATGACGC CAGGACGTGC GGCGCTGCTC
ATGCTGATAG ATCGGTATCT CGCCGGACTA CTCGATCCCT CCGTGACCCT ATTGGAACTC
CACAAGTTAA TGTACTTTCT TCAAGAAGCG GGAGAACCGC TCAAGCTTAA GTACCAAAAA
GCCCACTATG GGCCCTATGC CGAGAACCTT CGGCACGTGC TTCATGCGAT CGAGGGGCAC
ATGGTGTCGG GCTACGCGGA TGGTGGCGAC GCTCCCGACA AACAACTCGA ACTCGTTCCT
AAGGCTCTTC GCGATGCTGA GACCTTCTTG AAGAGCAAGG AGACGACGCG ATCGCACATG
CAGAGGGTCT TCGAACTCGT GGACGGTTTT GAGTCGCCGT TCGGGCTGGA GTTGCTGACG
ACCGTGCACT GGGTGGCAAC CAGGGAGCGG CCGCAGTCCG CGGACGAGGT CGTCTCGGCG
ATCCACGGCT GGAACGCTCG CAAGATGCAG TTCTCCAGAC GCCAGATTCT GCTCGCGCTC
GACGTTCTCT CGCGCAAAGG CTGGTACACA CCGGCGTGGG AGGCGAACGC ATGA
 
Protein sequence
MRWAFTLSNA GAYYFAEDCE AIVNTVNCVG VMGRGIALQF KKAYPENFKV YAAACKRKEV 
QPGQMLVFRT GRLMNPRYII NFPTKRHWRG KSRIEDIESG LVALADVLGA CRIRSIAIPP
LGAGLGGLDW MQVRERIEAA LGGLEDVQIV VFEPRAATAS ERPNRSREVP GMTPGRAALL
MLIDRYLAGL LDPSVTLLEL HKLMYFLQEA GEPLKLKYQK AHYGPYAENL RHVLHAIEGH
MVSGYADGGD APDKQLELVP KALRDAETFL KSKETTRSHM QRVFELVDGF ESPFGLELLT
TVHWVATRER PQSADEVVSA IHGWNARKMQ FSRRQILLAL DVLSRKGWYT PAWEANA