Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28901 |
Symbol | gcvP |
ID | 4778971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2553582 |
End bp | 2556530 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088413 |
Product | glycine dehydrogenase |
Protein accession | YP_001018885 |
Protein GI | 124024578 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.624172 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACCCTGC TTGAACAGCG TGCAGTTGAG GCTTCAAGCA TTCAGTCTTC ACCTTTTTTG GTGAGGCACA TTGGGCCCAG CTTTGAAGAC CAGCAGCAAA TGCTGCTGGA GCTTGGTCAT CGTGATCTTC AGAGCTTTGT TGCTGCTGTA GTTCCTCCAG ACATTCTCGA GACAACGGCA CCCACTAGTT CCCTTCCTGA GGGCTGTGGT GAAGTTCAGG CTATGGAGGA ACTGCGTCTC ATCGCGGCGG CGAATCGGGT GAGGCGTTCT TTAATTGGGC TTGGCTATTA CGGCACAGCA ACGCCTGCGT TGATTCAGCG ACAGGTGCTT GAAAATCCTG CTTGGTATAC CGCTTACACC CCTTATCAGG CTGAGATCTC TCAAGGACGG CTTGAGGCTC TGTTCAACTT TCAGACGCTG ATCAGTGAGC TCACGGGGTT GCCGATCGCC AATGCTTCCT TGCTTGATGA GGGAACGGCA GCTGCTGAGG CGATGAGCCT CAGCTTTGCG ATTTGCAAGC GCCCTCAGGC TCATCGTTTT CTCGTGGATG CGGAGGTTCT TCCCCAGACT CTTGCTGTGT TGAGAACTCG GGCTGAACCT CTGGGCATTC ATCTCGAGGT GGCCGAGCCG ATGACCTTTC AGTTTGATGC TGAGGTCTTT GGTGTTTTGC TTCAGTTGCC TGGACGTAGC GGCAGGCTCT GGGATCCCAC GACCTCCATT CAGGCTGCTC ATGAGGTAGG GGCCCTGGCA ACTGTTGCCA TTGATCCTCT TGCTCAGGTT TTGATTGCTC CAGTGGCTGA ATTTGGAGCT GATATTGCTG TTGGCAGTGT TCAACGGTTT GGGGTGCCAA TGGCCTTTGG GGGGCCTCAC GCCGCCTTTT TTGCCACCCT TGAGATCTTC AAGCGTCAGG TCCCTGGACG TCTAGTCGGC CAGTCTGTGG ATGCTGAGGG GCAACCGGCA TTGCGATTGG CGTTGCAGAC CCGGGAGCAA CACATCCGCC GAGACAAGGC CACCAGCAAT ATCTGCACGG CGCAGGTCTT ATTGGCTGTG ATGGCATCGT TCTATGCAGT TCACCATGGA CCCGATGGGC TCGCTGCAAT CGCTAGACGA GTCCTGCTTT TACGGGCTCA ACTGGAGAGG GGCTTGCACC AGTTGGGCTA TCCAGTCCAG TCGATAGCTC GCTTTGACAC GATTGAGGTG ATCTGTCGCG AAGCTCCTGC TGTTCATCAG GCAGCCGCTT TGGCGGGATT CAACCTGCGG GTGTTGCCTT TGGGTGTTGC TCCAGAGGCA GCCCATGGAT TTGGAATTAG TTTTGATGAG CTCAGCACCG ATCAGGAGCT GAAGTCGATC CTCCAGATTT TGGCTGAGGC TGCAGGTCAA CCTGTGCCAG TTCTCGAAGA CCTTGGGAAC CCTCACCTTG AGGAGCTTGT TGGCTTGCCG TTGCGCCAGC GACCTTGGTT GCAGCAGCAG GTGTTCCATC GCTATCGCAG TGAAACCGAG TTGCTTCGTT ACCTCCAGCG ATTGGTGGGC CGCGATCTTT CATTGGTGCA CGGGATGATC CCCCTGGGCA GTTGCACGAT GAAGCTCAAT GCTGCAGCTG AACTGATCCC CATTAGTTGG AGGGAGTTTG CGGCGCTGCA TCCTTTTGCT CCTCAGGATC AGTGTCATGG GTATCAGAGA TTGGTTCAGG ACCTGGAACA TTGGTTTGCA GAAATTACTG GCTTTGCCGG GGTGTCTTTG CAGCCGAATG CCGGCTCTCA GGGGGAATTA GCTGGCTTGT TGGTGATTCG CGCTTGGCAT CATTCCCGCG GAGAGCAGCA GAGAGATGTT TGTCTGATTC CCACCAGTGC CCATGGCACC AATCCAGCCA CGTGCGTGAT GGCCGGTTTG CGGGTGGTGC CTGTGGCTTG TGATGCAGAG GGGAATGTGG ATCTCAACGA TTTGGCTAGC AAGGCCGAGG CCCATGCTCC TCAGCTGGCA GCCCTGATGG TCACCTATCC CTCAACCCAT GGTGTCTTTG AACCACAGAT TCGTGAAATC TGTGAGCTCG TTCATGGTCA CGGTGGCCAG GTGTATCTCG ATGGCGCCAA TCTCAATGCG CAGATTGGCT TTTGTCGTCC TGGGACTTAT GGCGCTGATG TTTGTCATAT CAATCTGCAC AAGACCTTTT GTATCCCCCA TGGTGGTGGC GGACCAGGGG TGGGTCCGAT CGCAGTGGCT GCTCATTTGA TGCCTTTCCT TCCAGGTCAT CCCCTAGCTG CCTGCGGCGG CGAGCAGGGA ATCGGCGCAA TTTCGGCTGC GCCATGGGGA AGTGCAGGCA TCCTGCCGAT CAGCTGGATG TACCTGCGCA TGATGGGTGC GGAGGGATTG CGGCAGGCCA GTGCTGTTGC ACTGTTATCG GCTAATTACC TAGCCCATCG GTTGCATCCT CATTACCCAG TGCTGTTTCG GGGTCAAGCA GGGTTGGTTG CCCATGAATG CATCCTCGAC TTGCGCCCCC TTAAGCGAAG TGCAGGCTTG GAGGTGGATG ACATTGCAAA GCGGTTGATG GACTATGGCT TTCATGCCCC AACCGTGAGC TGGCCGGTGG CAGGGACTGT GATGGTGGAA CCCACTGAGA GCGAGAGCTT GGAGGAGCTC AACCGCTTCT GCGACGCCAT GATTGCCATA CGGGAAGAAA CGGCAGCGAT TGAGTCAGGC CAGATTGATC CTCAGAACAA CCCGCTGCGA CGGGCGCCTC ACACCTTGGC GGCCGTAACA GCAGAGGTTT GGGATCGTCC ATATTCCCGC GCCGAAGCTG CTTTCCCATT GGCTGAGCAG CGCCAAAGCA AGTTTTGGCC TGCGGTTTCG CGAATTGATA ATGCCTATGG CGATCGCAAT TTGCTGTGTA GCTGCCCATC GGTTGAGGAG CTTGCTGATA ACTCTGTTCT TAAACCTCCC CTGGTGTGA
|
Protein sequence | MTLLEQRAVE ASSIQSSPFL VRHIGPSFED QQQMLLELGH RDLQSFVAAV VPPDILETTA PTSSLPEGCG EVQAMEELRL IAAANRVRRS LIGLGYYGTA TPALIQRQVL ENPAWYTAYT PYQAEISQGR LEALFNFQTL ISELTGLPIA NASLLDEGTA AAEAMSLSFA ICKRPQAHRF LVDAEVLPQT LAVLRTRAEP LGIHLEVAEP MTFQFDAEVF GVLLQLPGRS GRLWDPTTSI QAAHEVGALA TVAIDPLAQV LIAPVAEFGA DIAVGSVQRF GVPMAFGGPH AAFFATLEIF KRQVPGRLVG QSVDAEGQPA LRLALQTREQ HIRRDKATSN ICTAQVLLAV MASFYAVHHG PDGLAAIARR VLLLRAQLER GLHQLGYPVQ SIARFDTIEV ICREAPAVHQ AAALAGFNLR VLPLGVAPEA AHGFGISFDE LSTDQELKSI LQILAEAAGQ PVPVLEDLGN PHLEELVGLP LRQRPWLQQQ VFHRYRSETE LLRYLQRLVG RDLSLVHGMI PLGSCTMKLN AAAELIPISW REFAALHPFA PQDQCHGYQR LVQDLEHWFA EITGFAGVSL QPNAGSQGEL AGLLVIRAWH HSRGEQQRDV CLIPTSAHGT NPATCVMAGL RVVPVACDAE GNVDLNDLAS KAEAHAPQLA ALMVTYPSTH GVFEPQIREI CELVHGHGGQ VYLDGANLNA QIGFCRPGTY GADVCHINLH KTFCIPHGGG GPGVGPIAVA AHLMPFLPGH PLAACGGEQG IGAISAAPWG SAGILPISWM YLRMMGAEGL RQASAVALLS ANYLAHRLHP HYPVLFRGQA GLVAHECILD LRPLKRSAGL EVDDIAKRLM DYGFHAPTVS WPVAGTVMVE PTESESLEEL NRFCDAMIAI REETAAIESG QIDPQNNPLR RAPHTLAAVT AEVWDRPYSR AEAAFPLAEQ RQSKFWPAVS RIDNAYGDRN LLCSCPSVEE LADNSVLKPP LV
|
| |