Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20111 |
Symbol | |
ID | 4778081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1768617 |
End bp | 1772291 |
Gene Length | 3675 bp |
Protein Length | 1224 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640087525 |
Product | hydantoinase/oxoprolinase:hydantoinase B/oxoprolinase |
Protein accession | YP_001018018 |
Protein GI | 124023711 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit [COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTGGC AGTTCTGGAT CGATCGCGGC GGAACCTTCA CAGATCTTGT CGGAATCAAT CCCGCTGGAG AATGCATTGT CCGCAAGGTG CTCTCGGAAC AGCCAGACCA GCCTGGTGAT CCAGCCGTCC GAGCGATCCG GGAGGTGTTG GAGCTCAAAG CAGGACAACC AATTCCCATC GGATTGATTG AAGAAGTGCG CCTCGGCACC ACCGTGGCCA CCAATGCCTT GCTTGAAAAT GCAGGAGAAG CGGTTCTTCT CTTCTGCAAT CGAGGTTTCA AAGATCTGCT CCGCATCGGC GATCAGCATC GGCCTGAATT ATTTGCTTTA CAGATCCGCC GCACGCCCTT CCTGGCACGA GCCGTGATCG AGGTGCCAGG CCGCCTCAAT GCCAAGGGGC AAGAAATCGA ACCGATCAGC TTCGATGCAG CCCTCGAAAA TGAAGTGCGG CGGCATGCCA AGGCAGGCTT GAAAAGCTGC GCCATTGCTC TCCTACATGC CTACCGCAAC CCCGAACACG AACTACACTT GCAGGACTGG TTAAATCAGC TGGGATTTAA CTCCGTGGTT TGCTCCCATC AGGTTTGCCC CCTGCCACGC CTGGTGCCCA GAGGACAAAC CACCCTGGTG GAAGCGACTG TTTCACCAGT ACTTTTCAAG TATCTAAATC AAGTCCGCAA GGAGATCGGA GCCTCAACAC GTCTACGCAT GATGGGCTCA AGCGGTGCAT TGCTGACACC TAAATGGTTA CTCGCTAAAG ACACAATTCT CTCCGGACCG GCAGGGGGGA TGGTTGGTGC TGTCGCCGCA GCACGAGCAT CAGGCCTCGC CCAACAACCT CTACTGGGCT TCGACATGGG TGGAACCTCC ACCGATGTGT TTCATGTTCC TGCCGGGCAG CAGGAGGAAG ACTGGCAGCG CAGCCCTGAA ACAGAGGTGG CCGGCCAGCG CTTAATGGCG CAGAGGTTAC CAATCCACAC CGTGGCAGCA GGGGGTGGAT CAATCATCAG CAGTGATGGC GAACGGCTAC AGGTTGGACC TCGCTCAGCT GGTGCTGACC CCGGACCAGC CTGTTACCGC CGTGGCGGCC CACTGACGAT CACGGACGCC AATCTCTTGC TGGGTCGCCT ACAGGTGAAC GAATTTCCAG CGCTATTCGG CCCATCGGCT AACCAGCCCC CAGACTTATC AGTTGTGCAA AAGCGCTTTA AGCAACTAGC GGAGACGATC GGCAGCACAC CAGAAGACAG CGCAGAAGGT GCCTTAGCAA TCGCTATCGA ACGCATGGCT GATGCAATCC GACAGGTCTC GTTACTACGC GGTCATGACA TCCGTGGAGG CGTGCTTGTG GCCTTCGGAG GGGCCAGTGG GCAACATGCC TGCCGGCTAG CAGCTCAGCT AGGACTGAAG CGAGTGCTGC TTCACCCCTT AGCAGGTGTG CTCTCAGCCC ATGGGATGGG CCATGCCCGT CAACGTCAAT TACGCGAAAG GTCTGTGCGT GAACCCCTTA ACGAAGACTT ATTGGACAAA CTTCAGCAAC TCATCAAGCT GGAACAGACG CAAGCAGAAC AACTCCTACA GGAATCAGGA GACCTTGCAA GTGCTGTTGA TTCAGCTCCA CCAAAGCGTT GGGCCCGCAT CGAACTGCGC TACGCATCCA GCGAACAAGG CTTGACGCTC TCCCTGAAGC CAACAACTTG CATCACGGAT ATACAAAAAG CATTCGCGGT CGCCCACCAA CAGCGCTTTA GCTATATCCC TCCTCACAAT CAGCCTTTAG TGGTGGAAAG GCTCGAAGTC GCAGTAGTAG CCCCCGCATC CCAAAGCGAT CAAAGTCGAT CAAGACGAGG TGACGTCCAA CTGTATACGC CTCCACCACG GAGCGAACAT CAACATGCTG AGGTGCATTG GCCAGATCTT GGCTGGCAGC AGGTGCCCCT CCATCATCGC GACCGTCTGA TAGCAGGGTC TGTACTAGAA GGGCCAGCGC TGATCTTGGA GGCTACAGGC TGCATCGTGC TAGAGCCCGG TTGGCGAGCA AGCGTGGATC AACAAGGAGC TCTTGTACTC GATGCCATCG CTGCAGATTC AATGATTACC AAACACCCTG TAACACTCGT CAAGCAAACG CCAGATCCAG TGTTACTGGA GTTATTCCAT CATCGCTTCA TGGCCATCGC CGAACAGATG GGGGAACGAC TACGTCAAAC CAGTCGTTCA GTGAACATCC GCGAACGACT GGATTTCTCT TGCGCCTTAT TCGATCATCA GGGTGCACTC GTCGCCAATG CCCCTCACAT TCCGGTTCAC CTTGGCTCGA TGGGTGAATC AGTCGCCGAT CTTCTGGCAC AAATTAACGC TGGCGAACGC GGACCACTGC GCCCAGGCGA GACAGTACTC AGCAACGATC CCTACCACGG CGGCACCCAT CTGCCAGACA TCACAGCGAT CACACCTGTG TTCACCACAA GCGACAAACC AAGCTATTTC GTCGCCTGTC GTGCGCATCA TGCCGATGTG GGTGGACTCA CGCCCGGTTC GATGCCGCCC TTTAGTCGCA GCATCAAAGA CGAAGGACTC CTCCTCCGTA ACGTGTCTTT TGTGATCGAT GGTCACCACG ACCGCAAGAG CTGGGAGCAA AGGCTTCACA GCGGCAACAT GCCTCCACGA AACCCAGCCG AATTGCTCGC CGATCTACAA GCGCAAGTCG CCGCCAACCA GTTAGGCGTT CAAGAGCTGA CGGCTCTTGT CGCCAGCACA GGTGATCGAC AAGTCAACAG ATACATGGCC TATGTACAGG CCAATGCGGC CGAAGCAGTG CGCAAGGTGA TCCAAACATT GAACAATCGC GCCTTCTCAG TAGAGCTCGA CAATGGCGCA AAGCTTTGCC TGAAGATCTC AATTGATAAG CATCAGCGAA CAGCAAAGGT TGATTTCACT GGCACCTCAG CCCAGCGCTC TGACAATTTC CAAGCTCCGC TGGCCGTAAC AAAAGCAGCG GTGCTTTATG TCTTCCGCTG TTTAGTGAAG GAGACGATCC CACTCAACGC CGGTTGCTTT GAACCGCTTG AACTGATCGT TCCCAATGGC TGCTTGCTCA ACCCGCACCC ACCTGCAGCA GTCGTAGCAG GAAATGTGGA AACCTCCCAA GCACTCTGCA ATCTATTGTT CGCTGCCCTA GGGGTTATGG CCGCAAGCCA GGGCACGATG AATAATCTCA GCTTCGGCGA CAGCGACCAT CAGTATTACG AAACGGTTGG CGGCGGCAGC GGAGCTGGCA AAGGGTTTGA TGGTGCTGAT GGCATACAGA CGCATATGAC CAATTCCCGC CTCACGGATC CAGAGATCCT TGAGCAGCGC TATCCAGTAC GGTTGGAGCT CTTTGCGTTA AGGCATGGTA GTGGCGGCCT TGGGCGATGG CGTGGTGGTG ATGGGTTGTT GCGACAATTT CGCTTCCTAG CGCCAATGAC AGCGTCGATT CTCTCTGGAT CCAGACGGAT TGCACCGTTC GGGCTATCAG GCGGCCTACC GGGGGCGTTA GGAGCAAACC AACTTGAACA CGTCAATGGA AAAAGAGAGC CACTCAAAGG ATGCGCAACG ATCAATATCG AATCCGGAGA GGCGTTGCTG ATCTGCACCC CAGGCGGTGG AGGTTACGGC AGACCGCGTG ACTAA
|
Protein sequence | MPWQFWIDRG GTFTDLVGIN PAGECIVRKV LSEQPDQPGD PAVRAIREVL ELKAGQPIPI GLIEEVRLGT TVATNALLEN AGEAVLLFCN RGFKDLLRIG DQHRPELFAL QIRRTPFLAR AVIEVPGRLN AKGQEIEPIS FDAALENEVR RHAKAGLKSC AIALLHAYRN PEHELHLQDW LNQLGFNSVV CSHQVCPLPR LVPRGQTTLV EATVSPVLFK YLNQVRKEIG ASTRLRMMGS SGALLTPKWL LAKDTILSGP AGGMVGAVAA ARASGLAQQP LLGFDMGGTS TDVFHVPAGQ QEEDWQRSPE TEVAGQRLMA QRLPIHTVAA GGGSIISSDG ERLQVGPRSA GADPGPACYR RGGPLTITDA NLLLGRLQVN EFPALFGPSA NQPPDLSVVQ KRFKQLAETI GSTPEDSAEG ALAIAIERMA DAIRQVSLLR GHDIRGGVLV AFGGASGQHA CRLAAQLGLK RVLLHPLAGV LSAHGMGHAR QRQLRERSVR EPLNEDLLDK LQQLIKLEQT QAEQLLQESG DLASAVDSAP PKRWARIELR YASSEQGLTL SLKPTTCITD IQKAFAVAHQ QRFSYIPPHN QPLVVERLEV AVVAPASQSD QSRSRRGDVQ LYTPPPRSEH QHAEVHWPDL GWQQVPLHHR DRLIAGSVLE GPALILEATG CIVLEPGWRA SVDQQGALVL DAIAADSMIT KHPVTLVKQT PDPVLLELFH HRFMAIAEQM GERLRQTSRS VNIRERLDFS CALFDHQGAL VANAPHIPVH LGSMGESVAD LLAQINAGER GPLRPGETVL SNDPYHGGTH LPDITAITPV FTTSDKPSYF VACRAHHADV GGLTPGSMPP FSRSIKDEGL LLRNVSFVID GHHDRKSWEQ RLHSGNMPPR NPAELLADLQ AQVAANQLGV QELTALVAST GDRQVNRYMA YVQANAAEAV RKVIQTLNNR AFSVELDNGA KLCLKISIDK HQRTAKVDFT GTSAQRSDNF QAPLAVTKAA VLYVFRCLVK ETIPLNAGCF EPLELIVPNG CLLNPHPPAA VVAGNVETSQ ALCNLLFAAL GVMAASQGTM NNLSFGDSDH QYYETVGGGS GAGKGFDGAD GIQTHMTNSR LTDPEILEQR YPVRLELFAL RHGSGGLGRW RGGDGLLRQF RFLAPMTASI LSGSRRIAPF GLSGGLPGAL GANQLEHVNG KREPLKGCAT INIESGEALL ICTPGGGGYG RPRD
|
| |