Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0734 |
Symbol | |
ID | 4711399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 820192 |
End bp | 823833 |
Gene Length | 3642 bp |
Protein Length | 1213 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 639855198 |
Product | 5-oxoprolinase (ATP-hydrolyzing) |
Protein accession | YP_001002318 |
Protein GI | 121997531 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGACG GCTTCGCGGC GGGAGAGCGC TGGCAGTTCT GGATCGATCG GGGCGGTACC TTCACCGACG TCATCGCCCG CGCCCCCGAC GGGCGCCTGA TCGCCCGCAA GTTCCTCTCC GAGAACCCGG AGCAGTACAC CGACGCCGCC CTCCACGGCA TCCGCACCAT CCTCGGTGTG GGGGCCGACG CCCCGATCCC GGCCGAGCGG ATCGAGGCGG TGCGCATGGG GACCACGGTG GCGACCAACG CGCTGCTCGA GCGCCGCGGC GAGCCCACGG TCCTCGCCAT CACCGAGGGC TTGGCCGATC AGCTGCGCAT CGGCTACCAG CACCGCCCCG ATATCTTTGA CCGGCGCGTG CGCCTGCCGC AGATGCTCTA TTCCGAGGTC ATCGAGATCC CCGAGCGGCT CGGCGCCGAC GGCTCGGTGG TCCGCACGCT GGATGCCGAC GCCGTGCGCG CCCGGCTCGA GCACACCTAC GCCGCCGGCT ACCGGGCGCT GGCGGTGGTG CTCATGCACG CCTGGCGCGA CGCCGCCCAC GAGCAGGTGG TGGCCCGCAT CGCCCGCGAG GTGGGCTTCA CCCAGGTGAC GACCAGTGCC GAGGCCGCCG CCGTGATGAA GATCGTCGGC CGGGGGGATA CTGCCGTGGT CGACGCCTAC CTGTCGCCGG TGCTGCGGCG GTACGTCGAG CGCCTGGCCG CCGAGCTCGG CGATGTGCCG CTGCTGTTCA TGCAGTCCAA CGGCGGGCTG ACCAGCGCCG CGCAGTTCCA GGGCAAGGAC GCCATCCTCT CCGGCCCCGC CGGCGGGATC GTCGGCGCCG TGCGCACCGC GGCCATGGCG GGCATCGACC GGCTGATCAG CTTCGACATG GGCGGGACCT CCACCGACGT GGCCCACTAC GACGGCGAGT TCGAGCGCAC CTTCGAGGCG GAGATCGCCG GCTGCCGCAT CCGCGCGCCG ATGATGCAGA TCCACACCGT CGCTGCCGGG GGCGGCTCGA TCTGCCACTT CGACGGCATG AAGTACCGGG TCGGGCCCGA CTCCGCCGGC GCCGATCCGG GGCCGGCGGC CTACCGCCGC GGCGGGCCGC TGACCGTCAC CGACTGCAAT GTCCTGCTCG GGCTGATCCG CCCGGAGTTC TTCCCGCGCC TCTTCGGCCC CGGTGCCGAT CAGCCCCTGG ACGCCGAGGG GGTGCACGAG CGCTTCGCCG AGCTGGCCGA GCGCATCCAC GCCGAGACCG GGGATGCCCG GGAGCCGGTG GCGGTGGCCG CCGGCTTTCG GCGTATCGCC GTGGAGAACA TGGCCCAGGC GATCAAGCGT ATCTCGGTGC AGCGCGGCTA CGACGTCACC CGCTACGCCC TCAACTGCTT CGGTGGCGCC GGGGGGCAGC ACGCCTGTGC CGTGGCCGAT GTGCTGGGCA TCCGCACGGT GTTCGTCCAC CCGCTGGCGG GGGTGCTCTC GGCCTACGGC ATGGGGCTGG CGGACATCAC CGCCATCAGT CAGCGCACCG TGGAGGCGCC CCTGGAGCCG GCCAGCGCCC CGCAGCTGGC CGACGTGATC GACGAGCTCG CCGCCGAGGC CCGCGCCGGG CTGGCGACCC AGGGGCTGGC GGGCGAGGCG GCGACGCTGC GCGTGCGGGC CCACGTCCGC TACGCCGGCA CCGATACCGC GCTCGAGGTC CCCGGCAGCG ACGACGTCGC CGAGGTCGAC GCGGCCTTTG CCCGGCTGCA CCGCCAGCGC TTCGGCTTCA CCCTCGATGA CCGCCCGCGG GTCCTCGAGG CGCTCAGCGT CGAGGCGATT CACCACGCCG CGGCGGAGGA AGCGGGCGAG GGCGAGGCGC CGGCGCCGAA GGCCCCGCCG CAGCCGCTGG CCCGGGTCAC GGCCTGGGAT GGCCAGCGAA TGGCGGAGCA GCCGGTCTAC GCCCGCGCCG ATCTGCTCCC GGGCACCCGG CTGAGCGGGC CGGCGATCCT CCAGGAGGAG AACGCCACCA CGGTGATCGA CGCCGGCTGG GAGGCGGAGG TGACCGGCGG CGATCAGCTG ATCCTGCGCC GGTCGGTGAC CGCCGAGGCG GGGCAGCCGG TGCAGACACC GCAGGTGGAT ACCCGGCGCC CCGACCCGGT GCTGCTGGAG GTGTTCAACA ACCTGTTCCG CTCGATTGCC GAGCAGATGG GCACGACCCT GGCCGGGACG GCGCAGTCGG TGAACATCAA GGAGCGGCTC GACTTCTCCT GTGCGCTCTT CGACGCCGAC GGCAACCTGG TGGCCAACGC GCCGCACATC CCGGTCCACC TCGGCTCGAT GTCGGAGGCG GTGCGCACCA TCCTCCATCG CCGCGGCGAG ACCCTGCGCC CCGGCGACGT CTTCCTGCTC AACGATCCCT ACAATGGCGG CACGCACCTG CCCGATCTCA CCGCGGTGAC GCCGGTCTTC TCCGCGGACG GGCAGGAGCT GCTCTTCTTC TGCGCCAGCC GTGGCCACCA CGCCGATGTC GGTGGCCGTA CCCCGGGATC GATGCCGGCG GACTCCACCC GGGTGACCGA GGAGGGGGTG CTCATCAACG ACCTCCAGGT CGTCGCCGAG GGGCGTTTCC TCGAGGAGGC GTTCACCACC GTGATGAGCG GTGGCCCGTA TCCGGCGCGC AACGTGGCGC AGAACATCGC CGATCTCAAG GCGCAGATCG CCGCCAACGA GAAGGGCGTC GCCGAGCTGC GGCGCATGGT GGCGCAGTTC GGCCTGGCGG TGGTGCAGGC CTACATGGGC TTCGTCCAGG AGAACGCCGC CGAGCACGTG CGCCGGGTCA TCGATCGCCT CGCCGACGGC GAGTTCACCG GTGATCTGGA TAACGGCGCG CGGATCCGGG TCTCGGTCCG CGTCGATCAC GCCGCGCGCC GGGCGCGGAT CGACTTCAGC GGCACCTCCG AGCAGCTGGC GGAGAGCAAC TTCAACGCCC CGCTGGCGAT CACCCGGGCG GCCACGCTGT ACGTCTTCCG CACCCTGGTC GAGGACGATA TCCCGCTCAA CGAAGGCTGC CTGGAGCCGC TGGAGATCGT CGTGCCGGAG GGTTCGATGC TCAACCCGCG CTACCCGGCC GCCGTGGTCG CCGGCAATGT GGAGACCTCC CAGGCGGTGA CCGACGCCCT CTACGGGGCG CTGCAGGCGA TGGCTGCCAG CCAGGGCACC ATGAACAACC TGACCTTCGG CAACCAGCGC TACCAGTACT ACGAGACCCT CTGCGGCGGC GCCGGCGCCG GGCCGGACTT CGCCGGCAGC TCTGCGGTCC ACACCCACAT GACCAACTCG CGCCTAACCG ATCCGGAGGT GCTGGAGTGG CGCTATCCGG TGCGCGTCGA GCGCTTCGCC ATCCGTCGGG GCAGCGGCGG GGGCGGGGCG TGCCCGGGCG GCGACGGGGT CATCCGGCGC CTGCGCTTCC TGGAGCCGAT GACCGCGGTG ACGCTGATGA ACCGCCGCCG CGTACCGCCC TTCGGGCTGG CCGGCGGCGC GGATGCGGCG TGCGGGCGCA ACGCCATCGA GCGCCGGGAC GGCACCGTCG AAGAGCTGCC GGGCACCGCC ACCCGGGATC TCGAGGCGGG GGATCAGATC CTCATCGAGA CCCCGGGCGG GGGCGGTTAC GGGGCCGGCT GA
|
Protein sequence | MTDGFAAGER WQFWIDRGGT FTDVIARAPD GRLIARKFLS ENPEQYTDAA LHGIRTILGV GADAPIPAER IEAVRMGTTV ATNALLERRG EPTVLAITEG LADQLRIGYQ HRPDIFDRRV RLPQMLYSEV IEIPERLGAD GSVVRTLDAD AVRARLEHTY AAGYRALAVV LMHAWRDAAH EQVVARIARE VGFTQVTTSA EAAAVMKIVG RGDTAVVDAY LSPVLRRYVE RLAAELGDVP LLFMQSNGGL TSAAQFQGKD AILSGPAGGI VGAVRTAAMA GIDRLISFDM GGTSTDVAHY DGEFERTFEA EIAGCRIRAP MMQIHTVAAG GGSICHFDGM KYRVGPDSAG ADPGPAAYRR GGPLTVTDCN VLLGLIRPEF FPRLFGPGAD QPLDAEGVHE RFAELAERIH AETGDAREPV AVAAGFRRIA VENMAQAIKR ISVQRGYDVT RYALNCFGGA GGQHACAVAD VLGIRTVFVH PLAGVLSAYG MGLADITAIS QRTVEAPLEP ASAPQLADVI DELAAEARAG LATQGLAGEA ATLRVRAHVR YAGTDTALEV PGSDDVAEVD AAFARLHRQR FGFTLDDRPR VLEALSVEAI HHAAAEEAGE GEAPAPKAPP QPLARVTAWD GQRMAEQPVY ARADLLPGTR LSGPAILQEE NATTVIDAGW EAEVTGGDQL ILRRSVTAEA GQPVQTPQVD TRRPDPVLLE VFNNLFRSIA EQMGTTLAGT AQSVNIKERL DFSCALFDAD GNLVANAPHI PVHLGSMSEA VRTILHRRGE TLRPGDVFLL NDPYNGGTHL PDLTAVTPVF SADGQELLFF CASRGHHADV GGRTPGSMPA DSTRVTEEGV LINDLQVVAE GRFLEEAFTT VMSGGPYPAR NVAQNIADLK AQIAANEKGV AELRRMVAQF GLAVVQAYMG FVQENAAEHV RRVIDRLADG EFTGDLDNGA RIRVSVRVDH AARRARIDFS GTSEQLAESN FNAPLAITRA ATLYVFRTLV EDDIPLNEGC LEPLEIVVPE GSMLNPRYPA AVVAGNVETS QAVTDALYGA LQAMAASQGT MNNLTFGNQR YQYYETLCGG AGAGPDFAGS SAVHTHMTNS RLTDPEVLEW RYPVRVERFA IRRGSGGGGA CPGGDGVIRR LRFLEPMTAV TLMNRRRVPP FGLAGGADAA CGRNAIERRD GTVEELPGTA TRDLEAGDQI LIETPGGGGY GAG
|
| |