Gene Hoch_3924 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3924 
Symbol 
ID8546320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5409168 
End bp5410157 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content70% 
IMG OID646388596 
ProductPorphobilinogen synthase 
Protein accessionYP_003268316 
Protein GI262197107 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0861463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.218881 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCT ATCCCCATAC CCGTATGCGG CGCAATCGCG CGAGCGACTG GAGCCGCCGC 
CTGGTGCGCG AGAACGCGCT GAGCACCGAT GACCTCATCT GGCCGGTGTT CGTGTGCGAG
GGCGACAGCA CGCGCGAGCG CGTGGGCTCG ATGCCCGGCG TCGAGCGCCT GTCCATCGAT
CTCCTCACCG AGGCCGCGGC CGAGGCCAAG GAGCTCGGCA TCCCGGCGCT GGCGCTGTTC
CCGGTGACGC CTGCGGACAA GAAGACGCCC GACGGCGAAG AGGCCAAGAA CCCGGACAAC
CTGGTGTGCC GCGCGGTCCG CGCCGTCAAG CAGCAGGTGC CCGGCATCGG CATCATCTGC
GATGTCGCCC TCGACCCGTA CACGACCACG GGCCAGGACG GGCTGGTGCG CGACGGCTTC
GTGGTCAACG ACGAGACCGT CGAGGTGCTG TGCGAGCAGG CGGTGGTCCA GGCCCAGGCC
GGCTGCGACG TCATCGCGCC TTCGGACATG ATGGACGGCC GCATCGGCGC CATCCGCGAC
GCGCTCGACG CCGAGGATCT CGACCAGGTG CAGATCCTGG CCTACTCGGC CAAGTACGCG
TCGGCGTTCT ACGGTCCCTT CCGCGACGCG GTGGGCTCGA GCGGCGCGCT CGGCACCGGC
GACAAGCGCA CCTACCAGAT GGATCCGGCC AACGGCGACG AGGCCGAGCG CGAGGTCGCC
CAGGATCTGG AAGAGGGCGC CGATATGGTC ATGGTCAAGC CCGGCATGCC GTACCTCGAC
ATCGTGTTCC GGGTGAAACA CACCTTCTCG GTGCCGACCT TCGTGTACCA GGTGAGCGGC
GAGTACGCGA TGCTGCGCGG CGCGGCCGAG CAGGGCTGGC TCGACTGGGA CAAGGTGCTG
CTCGAGAGCC TGCTGGCCTT CAAGCGCGCG GGCGCCGACG CCGTGCTCAC CTACGGCGCG
CTCGACGCGG CCCGCCTGCT GCGGAGGTAG
 
Protein sequence
MSSYPHTRMR RNRASDWSRR LVRENALSTD DLIWPVFVCE GDSTRERVGS MPGVERLSID 
LLTEAAAEAK ELGIPALALF PVTPADKKTP DGEEAKNPDN LVCRAVRAVK QQVPGIGIIC
DVALDPYTTT GQDGLVRDGF VVNDETVEVL CEQAVVQAQA GCDVIAPSDM MDGRIGAIRD
ALDAEDLDQV QILAYSAKYA SAFYGPFRDA VGSSGALGTG DKRTYQMDPA NGDEAEREVA
QDLEEGADMV MVKPGMPYLD IVFRVKHTFS VPTFVYQVSG EYAMLRGAAE QGWLDWDKVL
LESLLAFKRA GADAVLTYGA LDAARLLRR