Gene Hoch_1894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1894 
Symbol 
ID8544276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2603169 
End bp2604362 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content76% 
IMG OID646386599 
Productmolybdenum cofactor synthesis domain protein 
Protein accessionYP_003266334 
Protein GI262195125 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0303] Molybdopterin biosynthesis enzyme 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCACGG TCGAACAAGC CAGCGCCCGG GTGCGCGCCA GCGCCGCCCC GCTGGGCAGC 
GAGCTGGTGC CGCTCGCCCA TGCCCAGGGG CGCATCCTGG GCGCCGATCT GCGCGCCGGT
CGCGCGCTGC CGCCGCACGA CAACTCGGCC ATGGACGGCT TTGCCGTGCG CTGCGCCGAT
CTCCCGGGCA CCCTGCCCGT CGCCGGCACC GTGGCCGCCG GTGACGCAGG CGACGCGGTC
CTGGCCGCGG GCAGCGCGCT GCGCATCATG ACCGGCGCGC CCATGCCGGC CGGGGCCGAC
GCCGTGGTCA TCCGCGAAGA GGTCGAGGAC CTCGGCGAGC GCGCGCGCTT CGCGGCCGCC
GCGCAGCCCG GCGACAACCT GCGCCGCGCC GGCGAGGACA TCGCGCTGGG CGCGGTCGCC
CTGGCCGCCG GCATGCGCCT GGGCGCCGGC GAGCTCGGCC TGGCCGCCGC GCTCGGTCAC
AGTGCCCTGG CCGTGGCCCG GCGCCCGCGC GTGGCCATCC TGTCCACGGG CGACGAGCTG
GTGAGCGCCG AGGTGCCGCC GCGGCCGGGC CAGATCGTCA ACTCCAACGC CTACGCGCTG
GCCGCCCAGG TCCGCGAGGC CGGCGGCATC CCGGTCGACG CCGGCATCGC GCCCGACGAC
CCCGATATCC TGGTCGCCCG CGTGCGCAGC GCGCTGGCCG CCGACGTGCT GCTCACCGCG
GGCGGCGTCT CGGTCGGTGA CTTCGACTTC GTCAAGGACG CCTTCGCCCG CGCCGGCGTG
ACCATGGACT TCTGGAAGGT CGCGGTCAAG CCCGGCAAAC CGCTCGCATT CGGACACACG
TCCGACAAGC GCCCGGTGTT CGGCCTGCCC GGCAATCCCG TGTCATCGAT GCTCGGCTTC
GAGCTGTTCG TGCGCCCGCT GCTCCTGGCC ATGCAGGGCG CGCGCTCGCT CGATCGCCCG
CGCGCGACGG TCACGCTCGC CAGCGACTAC GGCAAGCGGC CGGGCCGCGA CCACTATCTG
CGCGCGCGCC TGCGCCGCGA GGGCGATGTC CTGCGCGCCG AGTTGCACCC CCGCCAGGGC
TCGGCCATGC TCGGCTCCAT GGTCGATATC GACGCCCTGG TCATCGCCCC CGCCGACAGC
GGCGACCTGC CCGCAGGCAC CCGCCTCGAG GCGCTGCTGC TGCGTGCGGT CTGA
 
Protein sequence
MLTVEQASAR VRASAAPLGS ELVPLAHAQG RILGADLRAG RALPPHDNSA MDGFAVRCAD 
LPGTLPVAGT VAAGDAGDAV LAAGSALRIM TGAPMPAGAD AVVIREEVED LGERARFAAA
AQPGDNLRRA GEDIALGAVA LAAGMRLGAG ELGLAAALGH SALAVARRPR VAILSTGDEL
VSAEVPPRPG QIVNSNAYAL AAQVREAGGI PVDAGIAPDD PDILVARVRS ALAADVLLTA
GGVSVGDFDF VKDAFARAGV TMDFWKVAVK PGKPLAFGHT SDKRPVFGLP GNPVSSMLGF
ELFVRPLLLA MQGARSLDRP RATVTLASDY GKRPGRDHYL RARLRREGDV LRAELHPRQG
SAMLGSMVDI DALVIAPADS GDLPAGTRLE ALLLRAV