Gene P9303_20111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_20111 
Symbol 
ID4778081 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1768617 
End bp1772291 
Gene Length3675 bp 
Protein Length1224 aa 
Translation table11 
GC content56% 
IMG OID640087525 
Producthydantoinase/oxoprolinase:hydantoinase B/oxoprolinase 
Protein accessionYP_001018018 
Protein GI124023711 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit
[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTGGC AGTTCTGGAT CGATCGCGGC GGAACCTTCA CAGATCTTGT CGGAATCAAT 
CCCGCTGGAG AATGCATTGT CCGCAAGGTG CTCTCGGAAC AGCCAGACCA GCCTGGTGAT
CCAGCCGTCC GAGCGATCCG GGAGGTGTTG GAGCTCAAAG CAGGACAACC AATTCCCATC
GGATTGATTG AAGAAGTGCG CCTCGGCACC ACCGTGGCCA CCAATGCCTT GCTTGAAAAT
GCAGGAGAAG CGGTTCTTCT CTTCTGCAAT CGAGGTTTCA AAGATCTGCT CCGCATCGGC
GATCAGCATC GGCCTGAATT ATTTGCTTTA CAGATCCGCC GCACGCCCTT CCTGGCACGA
GCCGTGATCG AGGTGCCAGG CCGCCTCAAT GCCAAGGGGC AAGAAATCGA ACCGATCAGC
TTCGATGCAG CCCTCGAAAA TGAAGTGCGG CGGCATGCCA AGGCAGGCTT GAAAAGCTGC
GCCATTGCTC TCCTACATGC CTACCGCAAC CCCGAACACG AACTACACTT GCAGGACTGG
TTAAATCAGC TGGGATTTAA CTCCGTGGTT TGCTCCCATC AGGTTTGCCC CCTGCCACGC
CTGGTGCCCA GAGGACAAAC CACCCTGGTG GAAGCGACTG TTTCACCAGT ACTTTTCAAG
TATCTAAATC AAGTCCGCAA GGAGATCGGA GCCTCAACAC GTCTACGCAT GATGGGCTCA
AGCGGTGCAT TGCTGACACC TAAATGGTTA CTCGCTAAAG ACACAATTCT CTCCGGACCG
GCAGGGGGGA TGGTTGGTGC TGTCGCCGCA GCACGAGCAT CAGGCCTCGC CCAACAACCT
CTACTGGGCT TCGACATGGG TGGAACCTCC ACCGATGTGT TTCATGTTCC TGCCGGGCAG
CAGGAGGAAG ACTGGCAGCG CAGCCCTGAA ACAGAGGTGG CCGGCCAGCG CTTAATGGCG
CAGAGGTTAC CAATCCACAC CGTGGCAGCA GGGGGTGGAT CAATCATCAG CAGTGATGGC
GAACGGCTAC AGGTTGGACC TCGCTCAGCT GGTGCTGACC CCGGACCAGC CTGTTACCGC
CGTGGCGGCC CACTGACGAT CACGGACGCC AATCTCTTGC TGGGTCGCCT ACAGGTGAAC
GAATTTCCAG CGCTATTCGG CCCATCGGCT AACCAGCCCC CAGACTTATC AGTTGTGCAA
AAGCGCTTTA AGCAACTAGC GGAGACGATC GGCAGCACAC CAGAAGACAG CGCAGAAGGT
GCCTTAGCAA TCGCTATCGA ACGCATGGCT GATGCAATCC GACAGGTCTC GTTACTACGC
GGTCATGACA TCCGTGGAGG CGTGCTTGTG GCCTTCGGAG GGGCCAGTGG GCAACATGCC
TGCCGGCTAG CAGCTCAGCT AGGACTGAAG CGAGTGCTGC TTCACCCCTT AGCAGGTGTG
CTCTCAGCCC ATGGGATGGG CCATGCCCGT CAACGTCAAT TACGCGAAAG GTCTGTGCGT
GAACCCCTTA ACGAAGACTT ATTGGACAAA CTTCAGCAAC TCATCAAGCT GGAACAGACG
CAAGCAGAAC AACTCCTACA GGAATCAGGA GACCTTGCAA GTGCTGTTGA TTCAGCTCCA
CCAAAGCGTT GGGCCCGCAT CGAACTGCGC TACGCATCCA GCGAACAAGG CTTGACGCTC
TCCCTGAAGC CAACAACTTG CATCACGGAT ATACAAAAAG CATTCGCGGT CGCCCACCAA
CAGCGCTTTA GCTATATCCC TCCTCACAAT CAGCCTTTAG TGGTGGAAAG GCTCGAAGTC
GCAGTAGTAG CCCCCGCATC CCAAAGCGAT CAAAGTCGAT CAAGACGAGG TGACGTCCAA
CTGTATACGC CTCCACCACG GAGCGAACAT CAACATGCTG AGGTGCATTG GCCAGATCTT
GGCTGGCAGC AGGTGCCCCT CCATCATCGC GACCGTCTGA TAGCAGGGTC TGTACTAGAA
GGGCCAGCGC TGATCTTGGA GGCTACAGGC TGCATCGTGC TAGAGCCCGG TTGGCGAGCA
AGCGTGGATC AACAAGGAGC TCTTGTACTC GATGCCATCG CTGCAGATTC AATGATTACC
AAACACCCTG TAACACTCGT CAAGCAAACG CCAGATCCAG TGTTACTGGA GTTATTCCAT
CATCGCTTCA TGGCCATCGC CGAACAGATG GGGGAACGAC TACGTCAAAC CAGTCGTTCA
GTGAACATCC GCGAACGACT GGATTTCTCT TGCGCCTTAT TCGATCATCA GGGTGCACTC
GTCGCCAATG CCCCTCACAT TCCGGTTCAC CTTGGCTCGA TGGGTGAATC AGTCGCCGAT
CTTCTGGCAC AAATTAACGC TGGCGAACGC GGACCACTGC GCCCAGGCGA GACAGTACTC
AGCAACGATC CCTACCACGG CGGCACCCAT CTGCCAGACA TCACAGCGAT CACACCTGTG
TTCACCACAA GCGACAAACC AAGCTATTTC GTCGCCTGTC GTGCGCATCA TGCCGATGTG
GGTGGACTCA CGCCCGGTTC GATGCCGCCC TTTAGTCGCA GCATCAAAGA CGAAGGACTC
CTCCTCCGTA ACGTGTCTTT TGTGATCGAT GGTCACCACG ACCGCAAGAG CTGGGAGCAA
AGGCTTCACA GCGGCAACAT GCCTCCACGA AACCCAGCCG AATTGCTCGC CGATCTACAA
GCGCAAGTCG CCGCCAACCA GTTAGGCGTT CAAGAGCTGA CGGCTCTTGT CGCCAGCACA
GGTGATCGAC AAGTCAACAG ATACATGGCC TATGTACAGG CCAATGCGGC CGAAGCAGTG
CGCAAGGTGA TCCAAACATT GAACAATCGC GCCTTCTCAG TAGAGCTCGA CAATGGCGCA
AAGCTTTGCC TGAAGATCTC AATTGATAAG CATCAGCGAA CAGCAAAGGT TGATTTCACT
GGCACCTCAG CCCAGCGCTC TGACAATTTC CAAGCTCCGC TGGCCGTAAC AAAAGCAGCG
GTGCTTTATG TCTTCCGCTG TTTAGTGAAG GAGACGATCC CACTCAACGC CGGTTGCTTT
GAACCGCTTG AACTGATCGT TCCCAATGGC TGCTTGCTCA ACCCGCACCC ACCTGCAGCA
GTCGTAGCAG GAAATGTGGA AACCTCCCAA GCACTCTGCA ATCTATTGTT CGCTGCCCTA
GGGGTTATGG CCGCAAGCCA GGGCACGATG AATAATCTCA GCTTCGGCGA CAGCGACCAT
CAGTATTACG AAACGGTTGG CGGCGGCAGC GGAGCTGGCA AAGGGTTTGA TGGTGCTGAT
GGCATACAGA CGCATATGAC CAATTCCCGC CTCACGGATC CAGAGATCCT TGAGCAGCGC
TATCCAGTAC GGTTGGAGCT CTTTGCGTTA AGGCATGGTA GTGGCGGCCT TGGGCGATGG
CGTGGTGGTG ATGGGTTGTT GCGACAATTT CGCTTCCTAG CGCCAATGAC AGCGTCGATT
CTCTCTGGAT CCAGACGGAT TGCACCGTTC GGGCTATCAG GCGGCCTACC GGGGGCGTTA
GGAGCAAACC AACTTGAACA CGTCAATGGA AAAAGAGAGC CACTCAAAGG ATGCGCAACG
ATCAATATCG AATCCGGAGA GGCGTTGCTG ATCTGCACCC CAGGCGGTGG AGGTTACGGC
AGACCGCGTG ACTAA
 
Protein sequence
MPWQFWIDRG GTFTDLVGIN PAGECIVRKV LSEQPDQPGD PAVRAIREVL ELKAGQPIPI 
GLIEEVRLGT TVATNALLEN AGEAVLLFCN RGFKDLLRIG DQHRPELFAL QIRRTPFLAR
AVIEVPGRLN AKGQEIEPIS FDAALENEVR RHAKAGLKSC AIALLHAYRN PEHELHLQDW
LNQLGFNSVV CSHQVCPLPR LVPRGQTTLV EATVSPVLFK YLNQVRKEIG ASTRLRMMGS
SGALLTPKWL LAKDTILSGP AGGMVGAVAA ARASGLAQQP LLGFDMGGTS TDVFHVPAGQ
QEEDWQRSPE TEVAGQRLMA QRLPIHTVAA GGGSIISSDG ERLQVGPRSA GADPGPACYR
RGGPLTITDA NLLLGRLQVN EFPALFGPSA NQPPDLSVVQ KRFKQLAETI GSTPEDSAEG
ALAIAIERMA DAIRQVSLLR GHDIRGGVLV AFGGASGQHA CRLAAQLGLK RVLLHPLAGV
LSAHGMGHAR QRQLRERSVR EPLNEDLLDK LQQLIKLEQT QAEQLLQESG DLASAVDSAP
PKRWARIELR YASSEQGLTL SLKPTTCITD IQKAFAVAHQ QRFSYIPPHN QPLVVERLEV
AVVAPASQSD QSRSRRGDVQ LYTPPPRSEH QHAEVHWPDL GWQQVPLHHR DRLIAGSVLE
GPALILEATG CIVLEPGWRA SVDQQGALVL DAIAADSMIT KHPVTLVKQT PDPVLLELFH
HRFMAIAEQM GERLRQTSRS VNIRERLDFS CALFDHQGAL VANAPHIPVH LGSMGESVAD
LLAQINAGER GPLRPGETVL SNDPYHGGTH LPDITAITPV FTTSDKPSYF VACRAHHADV
GGLTPGSMPP FSRSIKDEGL LLRNVSFVID GHHDRKSWEQ RLHSGNMPPR NPAELLADLQ
AQVAANQLGV QELTALVAST GDRQVNRYMA YVQANAAEAV RKVIQTLNNR AFSVELDNGA
KLCLKISIDK HQRTAKVDFT GTSAQRSDNF QAPLAVTKAA VLYVFRCLVK ETIPLNAGCF
EPLELIVPNG CLLNPHPPAA VVAGNVETSQ ALCNLLFAAL GVMAASQGTM NNLSFGDSDH
QYYETVGGGS GAGKGFDGAD GIQTHMTNSR LTDPEILEQR YPVRLELFAL RHGSGGLGRW
RGGDGLLRQF RFLAPMTASI LSGSRRIAPF GLSGGLPGAL GANQLEHVNG KREPLKGCAT
INIESGEALL ICTPGGGGYG RPRD