Gene Hoch_6440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6440 
Symbol 
ID8548855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8835720 
End bp8839382 
Gene Length3663 bp 
Protein Length1220 aa 
Translation table11 
GC content72% 
IMG OID646391101 
Product5-oxoprolinase (ATP-hydrolyzing) 
Protein accessionYP_003270802 
Protein GI262199593 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAC GCTGGCAGTT CTGGATCGAC CGCGGCGGCA CCTTCACCGA CTGCCTGGGC 
CGCGATCCCG ACACCGGCGC GCTGCACACG GCCAAAGTGC TGTCCTCGGA TCGCGCGCCG
ATCGTGGGCA TCCGCGAGAT CCTCACGCGC CACGCCGGGC TCGGCGAGGA CGAGCCCATC
TGGCCCTGCG AAGTGCGCAT GGGCACCACC ATCGCGACCA ACGCGCTGCT CGAGCGCCGC
GGCACGCCGT GCGCGCTGCT GATCACGCGC GGCTTTGGCG ACCTGCTGGC CATCGGCAAC
CAGACCAGAC CCGAGATTTT TGCGCTGCAC ATCGAGAAGC TGCCCATGCT CTACGACGCC
GTGCTCGAGA TCGACGCGCG CGTCGACGAG CGCGGCGAGG TGCTGGCGCG GCCCGACGCC
CAGGATCTGC GGCAGTCGCT CAGCGGTCTG CGCGAGCGCG GCATCGACAG CCTGGCCGTG
GTCGTATTGC ACGCCTATCG CGCCGGCGAG CTGGAGCGCG AGATCGGCGA ACAGGCGACC
GCGCTGGGTT TCGACCACGT GTCCCTGTCG CACGAGGTGG CGGCCGAGAT CGGCATGGTC
GGCCGCGGCG ATACCACCGT GGTGGACGCC TACCTCACGC CGCTCATCCG CGACTATCTG
CGCGAGCTTC TGGCCGAGCT GCCCGGCAGC TCGCTGCGCA TCATGCAGTC GAGCGGGGGC
CTCACCGGCG CGGCTCGCTT CCGCGGGCGC GACGCGGTGC TCTCGGGACC GGCCGGCGGC
GTGGTCGCGG CCGCCCATGT GGCGCGCGAG GCCGGCTACG AGCGCGCCAT CGGCTTCGAC
ATGGGCGGCA CCTCCACCGA CGTGTCGTGC TACGACGGTG ACTTCGAACG ACAGTACGAA
AACGAGGTCG CCGGCGTGCG GCTGCGCGCG CCGATGATGG CCATCCACAC GGTCGCGGCC
GGCGGCGGCT CGCTGTGCGC GTATCGCGGC TTTCGCCTCA CCGTCGGCCC GGAGAGCGCA
GGCGCCGTGC CCGGTCCGCT GTGCTACGGA CACGAGGATG CGCGCGCCCT GGCGCTCACC
GATATCAACC TCACGCTCGG ACGCGTGGTC GACGACCGCT TTCCCTTCCC GCTGGCGCGC
GAGCGCGTGG ACGCAGCCCT CGATGCGCTG CTCGGCGAGC TGCCGGCCGA GCCCGCCTAC
GATCGCGAAT CGCTCGCGGC CGGCTTCTTC GCCATCGCCA ACGCCAGCAT GGCCGAGGCC
ATCCGCCAGG TGTCGGTGGC CAAGGGCCGC GACGTGCGCG AGTACGCACT GGTGGTCTTC
GGCGGTGCCG GCGGCCAGCA CGCCTGCCCC ATCGCCCGCC AGCTCGGCAT CCGCACCCTG
GTGTTTCATC GCTTCGCCGG CGTGCTCTCG GCCTACGGCA TGGGCCTCGC CGACGTGAGC
TGGCACGGCG AGCGCGACGC CGGCCGCGCA CAGCTCGACG ACGACCTGCC GGCCGCGCTC
GCCGACGACT ACCGCGCGCT GCTCGACGAG GGCCGGCGCG TGCTCGCCGA CGAGGGCTTT
GCCGAAGTGC GCGGCCGGCG GCGGCTGGAC CTGCGCTACC GCGGCACCGA CAGCGCGCTG
CCCGTGGATG TCGATCCCGC CGACGCCGAC ACCGACACCG ACACCGACAC CGACGCCGGC
GCCGCGTTCC TGCCCAGCGA AGGCGCCGTG GAGTACGCGC TATCGTTCGA CGCCACCGCC
CTGCGCGCGG CCTTTGAGCG CGCGCACGAG CAGCGCTTCG GCTACATTCG CCCCGGTCAC
CCGGTCGAGG CCATGGCCGT GCGCGTCGAT GTCGCGGGGC GAAACAGTAC AGACGTCCGA
AGTCGTGCGA CGCACGCTGG CGGGAGCGCC CCGCTGCCGG CGCCGCGACG CCGCGCGCGC
ATGTGGAGCA GCGGCGCCAT GGCCGACGAG GTGCCCGTGT ACGCGCGCGA AGATCTACCG
GTGGGCGCGC GTCTGCGCGG GCCGGCGCTG GTGCTCGACG ACACCGGGAC CATCGCCGTA
GACCGCGGCT TCACCCTCGA GGTGGCCGCA GCCGATCGCG TCGAAGTGCG CGATGAGCAG
CCCGAGGTGG CCGCTGCCGC TGGCGACCAC ACCCAGGTCG ACCCGGTGCG GCTCGAGATC
TTCAACAACG TGTTCATGTC GATCGCCACG CAGATGGGCG AGGTGCTGCG GCGCACGGCG
CTGTCGACCA ACATCCGCGA GCGCCTCGAC TTCTCGTGTG CGGTGTTCGA CGCCGACGGT
GGGCTGGTCG CCAACGCGCC GCACATCCCC GTGCATCTCG GCGCCATGGG CGAGTCGATC
AAGGGCGTGC TCGCCGTGCA TCCCGCGCCC GCGCCCGGCT CGGTGTTCGC GATCAACGAC
CCCGCCGCTG GCGGCTCGCA TCTGCCCGAT GTCACCGTGG TGACGCCGGT CCACGACGGC
GACGGCCGGT TGGCTTTTTT CACCGCCAGC CGCGGGCATC ACTCGGATAT CGGCGGCATC
ACGCCGGGAT CGATGCCACC CTTTTCCACC CGATTGTCCG AAGAAGGCGC CGTATTCCGC
GCCCTGGCGA TCGTCGTGGA TGGCGACTTT CGCGAGCGCG AGGTGCTCGG CGTGCTCGAG
GCCGGTCCGC ACCCGGCCCG CGATCCCGGC CAGAACATCG CCGACCTGCA GGCCCAGGTG
GCCGCCAATC GCGCCGGCGC GCATCTGCTC GCCGAGCTGG TGCAGCGCTA CGGCCGGGCT
ACGGTCCACG CCTACATGGG CCACGTCCAG GACAACGCGG CCGCCCAGGT GGCCGCCGCC
ATCGCGGCAT TGCCCGACGG CGTCCATCGC TTCGAGGACG CGCTCGACGA GGGCGCGCGT
ATCGCGGTGG CCGTCCACGT GGACGGAAAC CGCCTGCGCG TGGACTTTGC CGGCACCAGC
GAGCAGCTCG AGAGCAACCT CAACGCGCCC CGGGCGGTGA CCGTGGCCGC ATTGCTGTAC
GTATTGCGCG CGCTGGTCGG CGTGCCAATT CCGCTCAACA GCGGCTGTTT GCGCGCGGTG
ACGCTGGCGA TTCCCGCGGG TTCGCTGCTG GCGCCCGAGC CCGATCGCGC CGTGGCCGGC
GGCAACGTCG AGACCTCGCA GCGCGTGGTC GACGTGCTGC TCGGCGCGCT CGGGCAAGCG
GCCGCGAGCC AGGGGACGAT GAACAACCTC ACCTTCGGCA ACGAGCGCTT CGGCTACTAC
GAGACCATCG CCGGCGGCGC CGGAGCCACG GCTCAGGGCG CCGGTGCATC GGGCGTGCAC
ACGCACATGA CCAACACCCG CATCACCGAC CCCGAAGTGC TCGAGACGCG CTTTCCCGTG
CGTCTGCTGC GCTTTTCGCT GCGACCCGGC TCGGGCGGCG CGGGCCGGCA CCGCGGCGGC
GACGGCGTGA TCCGCGAGTA TCTGCTGCGC GCGCCCATGC GCGTGTCGAT CCTGAGCGAG
CGGCGCACGC GCCAGCCCTT CGGCCTCGCC GGCGGCCAGC CCGGAGCCGC CGGTCGCAAC
CTGCTCAACG GCGAAGCCCT GCCGGCCAAG GCCAGCGTCG ACGCCGCAGC CGGCGACGTG
CTGTGCATCG AGACGCCGGG CGGCGGCGGC TTCGGCGCGC TCGCTGACGA CGAAACCACC
TGA
 
Protein sequence
MSARWQFWID RGGTFTDCLG RDPDTGALHT AKVLSSDRAP IVGIREILTR HAGLGEDEPI 
WPCEVRMGTT IATNALLERR GTPCALLITR GFGDLLAIGN QTRPEIFALH IEKLPMLYDA
VLEIDARVDE RGEVLARPDA QDLRQSLSGL RERGIDSLAV VVLHAYRAGE LEREIGEQAT
ALGFDHVSLS HEVAAEIGMV GRGDTTVVDA YLTPLIRDYL RELLAELPGS SLRIMQSSGG
LTGAARFRGR DAVLSGPAGG VVAAAHVARE AGYERAIGFD MGGTSTDVSC YDGDFERQYE
NEVAGVRLRA PMMAIHTVAA GGGSLCAYRG FRLTVGPESA GAVPGPLCYG HEDARALALT
DINLTLGRVV DDRFPFPLAR ERVDAALDAL LGELPAEPAY DRESLAAGFF AIANASMAEA
IRQVSVAKGR DVREYALVVF GGAGGQHACP IARQLGIRTL VFHRFAGVLS AYGMGLADVS
WHGERDAGRA QLDDDLPAAL ADDYRALLDE GRRVLADEGF AEVRGRRRLD LRYRGTDSAL
PVDVDPADAD TDTDTDTDAG AAFLPSEGAV EYALSFDATA LRAAFERAHE QRFGYIRPGH
PVEAMAVRVD VAGRNSTDVR SRATHAGGSA PLPAPRRRAR MWSSGAMADE VPVYAREDLP
VGARLRGPAL VLDDTGTIAV DRGFTLEVAA ADRVEVRDEQ PEVAAAAGDH TQVDPVRLEI
FNNVFMSIAT QMGEVLRRTA LSTNIRERLD FSCAVFDADG GLVANAPHIP VHLGAMGESI
KGVLAVHPAP APGSVFAIND PAAGGSHLPD VTVVTPVHDG DGRLAFFTAS RGHHSDIGGI
TPGSMPPFST RLSEEGAVFR ALAIVVDGDF REREVLGVLE AGPHPARDPG QNIADLQAQV
AANRAGAHLL AELVQRYGRA TVHAYMGHVQ DNAAAQVAAA IAALPDGVHR FEDALDEGAR
IAVAVHVDGN RLRVDFAGTS EQLESNLNAP RAVTVAALLY VLRALVGVPI PLNSGCLRAV
TLAIPAGSLL APEPDRAVAG GNVETSQRVV DVLLGALGQA AASQGTMNNL TFGNERFGYY
ETIAGGAGAT AQGAGASGVH THMTNTRITD PEVLETRFPV RLLRFSLRPG SGGAGRHRGG
DGVIREYLLR APMRVSILSE RRTRQPFGLA GGQPGAAGRN LLNGEALPAK ASVDAAAGDV
LCIETPGGGG FGALADDETT