Gene Hoch_2286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2286 
Symbol 
ID8544672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3184518 
End bp3186575 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content69% 
IMG OID646386991 
Producttype I phosphodiesterase/nucleotide pyrophosphatase 
Protein accessionYP_003266722 
Protein GI262195513 
COG category[S] Function unknown 
COG ID[COG3379] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTATT CACTCCACCG CGCCGCCCCT CGGGACCAGA CCCGTGGCGG CAAGGCGCTC 
CGCGCGCGGC TGCTGAGCGC GGCCCTATTT GCCACAGCGG CCGCGAGCGC GAGCCTCATC
CTGGGCGCCG GCTGCGGCGG CGACTCGGAC ACCTCCTCGT GGCAGGCGAC CGACAAGCGC
GTCGTGGTCC TCGGCGTCGA CGGCATGGAC CCCAAGCTGC TCGACCAGTA CATGCGCGAG
GGCCGCATGC CCAACCTCAA GGCCCTGGCC GAGCGCGGCA GCTACATGCC GCTGGCGACC
ACCTTCCCGC CGCAGAGCCC GGTGGCGTGG TCGACCTTCA TCACCGGCAT GGAGGCCCAC
GGTCACGGCA TCTACGATTT CGTCCACCGC GACCCGCACA CCCTGGCGCC GTATCTGTCG
ACCTCGAGCA CGCACTCGGC CGAGACCTTC ACCCTGGGCT CGTGGCGGCT GCCCAATCCC
TTCTCCTCGG CCAGCGTCGA GCTCTTGCGC GGCGGCCGTG CCTTCTGGCA GATGCTCGAG
GACCAGAAGG TCCCGGCCAC CGTGGTCAAG GTGCCGGCCA ACTTCCCGCC CGCGGACTCG
CGGTACAACC CGTCCATGGC CGGCATGGGC ACGCCCGACA TCCTCGGCAC CTACGGCACC
TTTCAGCTCT TCACCGACGA CCCGGCCTTT GTCGGACGCA AGGTCTCGGG CGGCATCATC
CACCCGCTCG ACTTCGCGGG CGGCCAGCGC GCCAGCGCGC CGCTCGACGG CCCGCCCGAG
CCGCTCGACG CTGAGAACAA AGCCATGAGC GTCGAGGTCG AGATCCTGCG CGACCCCGAG
CGCGATGTCG CCGTGGTGCG CGTCGGCGAC AGCGAGCAGG TGCTCGCCGT CGGCGACTGG
AGCGACTGGA TCCCGCTCGG CTTCGACCTG TCGATGTCGA CCACCGACCT GCCGGGCATG
GTGCGCGTGT ACCTGGCCGA GCTGCGCCCG CACGTGCGGC TCTACGCCAG CCCGACCAAC
ATCGATCCCA CCGCGCCGGC CATGCCGATC TCGTCGCCGG CGAGCTTTGC CGAGGACGTG
GCCGGCGACA TCGGCCGCTT CTACACTCAG GGCATGCCCG AGGACACCAA GGCCCTGGCC
GCGGGCGCGC TCTCGGACGA CGAGTTCTTG GCCCAGGCCG AGCTGGTGTG GGAGGAGCGT
ATGCGCCTGC TCGAGCGCGA GCTGTCGAGC TTCATCGCCG GCAAGGGCGG GGTGCTGTTC
TTCTACTTCT CGTCCATCGA CCAGGTGTCC CACGTCTTCT TTCGCACCCT CGATCCCAGC
TTGCGGCCCG AGCACGTGGA GGAGGACAGC AAGCACGCCG ATCTCATCCC CTCGCTGTAC
CAGCGCATGG ACAAGGTCAT CGGCGACGTC ATCGAGCGCA TCGGCCCCGA CACCGATATC
GTCGTGATGT CCGATCACGG CTTCTCGCCG TACCGCTACA AAGTGCACCT CAACGACTGG
CTGGCGCAGC AGGGCTACCT GGCGCTCCTG CCCTCGGACG ACCCCGACGC GCCGGCCAAG
ATCGACTGGG ACAGCACCCA GGCCTACGCC GTCGGCCTCA ACCAGGTGTT CATCAACCTG
CAGGGCCGCG AGGCGCACGG CGTGGTCCCG GCGTCCGAAT ACGACGTCCT GGTCGAGCGC
CTGGCGCGCC AGCTCGAGCG CCTGCGCGAC CCCAACACCG GCGCCTACGT GGTCACCGAG
GCCGTGCGTC CGGGTCCGAG CGAGTTCCCC GAGCGCTCGC CCGACCTGCT CGTCGGCTAC
GGCCGCGGCT ATCGCAGCTC CGACGAGTCC GCCGAGCTGC GCGTGGTCGG CGGCGACGCC
GAGATCATCG AGCCCAACCG CGACAAGTGG AGCGGCGATC ACTGCATGCA TCCCTCGCAC
GTGCCCGGCG TGTTGCTCAC CAATCGCAAG ATCGAGGCCG AGAGCGCTTC GCTTCTCGAT
CTCGCGCCGA CTATTCTCGC GTACTTCGGC ATTGCCAAAA GCGACGCCAT GAGCGGCAAG
ACGCTCTGGC AACCCTAG
 
Protein sequence
MSYSLHRAAP RDQTRGGKAL RARLLSAALF ATAAASASLI LGAGCGGDSD TSSWQATDKR 
VVVLGVDGMD PKLLDQYMRE GRMPNLKALA ERGSYMPLAT TFPPQSPVAW STFITGMEAH
GHGIYDFVHR DPHTLAPYLS TSSTHSAETF TLGSWRLPNP FSSASVELLR GGRAFWQMLE
DQKVPATVVK VPANFPPADS RYNPSMAGMG TPDILGTYGT FQLFTDDPAF VGRKVSGGII
HPLDFAGGQR ASAPLDGPPE PLDAENKAMS VEVEILRDPE RDVAVVRVGD SEQVLAVGDW
SDWIPLGFDL SMSTTDLPGM VRVYLAELRP HVRLYASPTN IDPTAPAMPI SSPASFAEDV
AGDIGRFYTQ GMPEDTKALA AGALSDDEFL AQAELVWEER MRLLERELSS FIAGKGGVLF
FYFSSIDQVS HVFFRTLDPS LRPEHVEEDS KHADLIPSLY QRMDKVIGDV IERIGPDTDI
VVMSDHGFSP YRYKVHLNDW LAQQGYLALL PSDDPDAPAK IDWDSTQAYA VGLNQVFINL
QGREAHGVVP ASEYDVLVER LARQLERLRD PNTGAYVVTE AVRPGPSEFP ERSPDLLVGY
GRGYRSSDES AELRVVGGDA EIIEPNRDKW SGDHCMHPSH VPGVLLTNRK IEAESASLLD
LAPTILAYFG IAKSDAMSGK TLWQP