Gene Hoch_2334 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_2334 
Symbol 
ID8544720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp3243808 
End bp3245175 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content69% 
IMG OID646387038 
Productamidohydrolase 
Protein accessionYP_003266769 
Protein GI262195560 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.202717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACT GGCTTTGGTT TCTCCGCATC GCGTCGGTCT GCGCCCTGGT GTGCGCTTGC 
GGGGCTCCGC GCGGGCTTCA CGCCAGCCCG CCGGAGAGCG CCGCCAGCGG CGCCTTGTTG
CCCGGTAGCT CGTTCTTGAT TCACGCCGGC GCGCTCATCG ACGGACGCTC GGACGAGCGC
CGCACCCAGG TCAGCATCAC CGTGCAAGAC GGCAAAATCG CGGCCGTCTC GCCCGGCTTC
GCGCGCCCGG CTCGCGGCCA GCGCGTCATC GACCTGCGCG CCTTCACCGT GCTGCCCGGG
CTGATGGATA TGCACACGCA CCTCTCCGGT GAGCACAGCG ACAAGAGCTA CTCCGAGCGC
TTCTTCATGG ATCCCAGCGA TGTCGCGCTG CGCTCCACGG TGTTCGCCCG CCGCACCCTG
ATGGCCGGGT TCACCACCGT GCGCAACCTC GGCGACAGCC ACAACGTCAC CCGGGCGCTG
CGCGACGCCG TGGCCAAGGG CTGGGTCGTG GGTCCGCGCA TCTTCACCGC CACCAAGTCG
ATCGCCACCA CCGGCGGCCA CGCCGACCCG ACCAACGGCC TCAACGTCGA GCTGCGCGGT
GAGCCCGGGC CCAAGCAGGG CGTCATCAAC AGCCCCGAAG AAGCCCGCGC AGCCGTGCGC
CAGCGCTACA AGGAAGGCGC CGATCTCATC AAGATCACGG CCACAGGCGG CGTGCTCAGC
CTCGCGGCCA GCGGCCAGAA CCCGCAGTTC ACCAGCCTCG AACTCGAGGC CCTGGTGACC
GCGGCCAAGG ACTACGGCTT CACCGTGGCC GTGCACGCGC ACGGCGCCGA GGGCATGCGC
CGCGCTGTAC TCGCGGGCGT GAGCTCGATC GAGCACGGCA CCTACATGGA CGACGAGATC
ATGGCGCTGA TGAAAGCGCG CGGCACCTAC TACGTCCCGA CCATCTCGGC CGGCCGTTGG
GTCGCGGACA AAGCCAAGGA GGACGGCTAT TTCCCCGCTA TCGTGCGCCC CAAAGCCGCC
GCCATCGGCC CGCAGATCCA GGACACCTTC GCGCGCGCCT ACCGCGCCGG CGTCAACATC
GCCTTCGGCA CCGACACCGG GGTCTCGGCC CACGGCGACA ACGCCCGGGA ATTCGTCTAC
ATGGTCGAAG CCGGCATGCC GCCCATGGCC GCGATCCAGT CGGCGACCCG CGAGGCAGCC
AAGCTGCTGC GCATCGACGA TCGCCTGGGC ACGGTCGAAG TCGGCAAGAT CGCCGACCTC
GTCGCGGTGC GCGACAATCC CCTCGAGCGC ATCGAGACCA TGCTCGATGT GGCTTTTGTG
ATGAAGGACG GCCAGGTCTT CAAGCTGCCG GCGACCGCCG AGCCGTGA
 
Protein sequence
MKHWLWFLRI ASVCALVCAC GAPRGLHASP PESAASGALL PGSSFLIHAG ALIDGRSDER 
RTQVSITVQD GKIAAVSPGF ARPARGQRVI DLRAFTVLPG LMDMHTHLSG EHSDKSYSER
FFMDPSDVAL RSTVFARRTL MAGFTTVRNL GDSHNVTRAL RDAVAKGWVV GPRIFTATKS
IATTGGHADP TNGLNVELRG EPGPKQGVIN SPEEARAAVR QRYKEGADLI KITATGGVLS
LAASGQNPQF TSLELEALVT AAKDYGFTVA VHAHGAEGMR RAVLAGVSSI EHGTYMDDEI
MALMKARGTY YVPTISAGRW VADKAKEDGY FPAIVRPKAA AIGPQIQDTF ARAYRAGVNI
AFGTDTGVSA HGDNAREFVY MVEAGMPPMA AIQSATREAA KLLRIDDRLG TVEVGKIADL
VAVRDNPLER IETMLDVAFV MKDGQVFKLP ATAEP