Gene P9303_28901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28901 
SymbolgcvP 
ID4778971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2553582 
End bp2556530 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content56% 
IMG OID640088413 
Productglycine dehydrogenase 
Protein accessionYP_001018885 
Protein GI124024578 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.624172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCCTGC TTGAACAGCG TGCAGTTGAG GCTTCAAGCA TTCAGTCTTC ACCTTTTTTG 
GTGAGGCACA TTGGGCCCAG CTTTGAAGAC CAGCAGCAAA TGCTGCTGGA GCTTGGTCAT
CGTGATCTTC AGAGCTTTGT TGCTGCTGTA GTTCCTCCAG ACATTCTCGA GACAACGGCA
CCCACTAGTT CCCTTCCTGA GGGCTGTGGT GAAGTTCAGG CTATGGAGGA ACTGCGTCTC
ATCGCGGCGG CGAATCGGGT GAGGCGTTCT TTAATTGGGC TTGGCTATTA CGGCACAGCA
ACGCCTGCGT TGATTCAGCG ACAGGTGCTT GAAAATCCTG CTTGGTATAC CGCTTACACC
CCTTATCAGG CTGAGATCTC TCAAGGACGG CTTGAGGCTC TGTTCAACTT TCAGACGCTG
ATCAGTGAGC TCACGGGGTT GCCGATCGCC AATGCTTCCT TGCTTGATGA GGGAACGGCA
GCTGCTGAGG CGATGAGCCT CAGCTTTGCG ATTTGCAAGC GCCCTCAGGC TCATCGTTTT
CTCGTGGATG CGGAGGTTCT TCCCCAGACT CTTGCTGTGT TGAGAACTCG GGCTGAACCT
CTGGGCATTC ATCTCGAGGT GGCCGAGCCG ATGACCTTTC AGTTTGATGC TGAGGTCTTT
GGTGTTTTGC TTCAGTTGCC TGGACGTAGC GGCAGGCTCT GGGATCCCAC GACCTCCATT
CAGGCTGCTC ATGAGGTAGG GGCCCTGGCA ACTGTTGCCA TTGATCCTCT TGCTCAGGTT
TTGATTGCTC CAGTGGCTGA ATTTGGAGCT GATATTGCTG TTGGCAGTGT TCAACGGTTT
GGGGTGCCAA TGGCCTTTGG GGGGCCTCAC GCCGCCTTTT TTGCCACCCT TGAGATCTTC
AAGCGTCAGG TCCCTGGACG TCTAGTCGGC CAGTCTGTGG ATGCTGAGGG GCAACCGGCA
TTGCGATTGG CGTTGCAGAC CCGGGAGCAA CACATCCGCC GAGACAAGGC CACCAGCAAT
ATCTGCACGG CGCAGGTCTT ATTGGCTGTG ATGGCATCGT TCTATGCAGT TCACCATGGA
CCCGATGGGC TCGCTGCAAT CGCTAGACGA GTCCTGCTTT TACGGGCTCA ACTGGAGAGG
GGCTTGCACC AGTTGGGCTA TCCAGTCCAG TCGATAGCTC GCTTTGACAC GATTGAGGTG
ATCTGTCGCG AAGCTCCTGC TGTTCATCAG GCAGCCGCTT TGGCGGGATT CAACCTGCGG
GTGTTGCCTT TGGGTGTTGC TCCAGAGGCA GCCCATGGAT TTGGAATTAG TTTTGATGAG
CTCAGCACCG ATCAGGAGCT GAAGTCGATC CTCCAGATTT TGGCTGAGGC TGCAGGTCAA
CCTGTGCCAG TTCTCGAAGA CCTTGGGAAC CCTCACCTTG AGGAGCTTGT TGGCTTGCCG
TTGCGCCAGC GACCTTGGTT GCAGCAGCAG GTGTTCCATC GCTATCGCAG TGAAACCGAG
TTGCTTCGTT ACCTCCAGCG ATTGGTGGGC CGCGATCTTT CATTGGTGCA CGGGATGATC
CCCCTGGGCA GTTGCACGAT GAAGCTCAAT GCTGCAGCTG AACTGATCCC CATTAGTTGG
AGGGAGTTTG CGGCGCTGCA TCCTTTTGCT CCTCAGGATC AGTGTCATGG GTATCAGAGA
TTGGTTCAGG ACCTGGAACA TTGGTTTGCA GAAATTACTG GCTTTGCCGG GGTGTCTTTG
CAGCCGAATG CCGGCTCTCA GGGGGAATTA GCTGGCTTGT TGGTGATTCG CGCTTGGCAT
CATTCCCGCG GAGAGCAGCA GAGAGATGTT TGTCTGATTC CCACCAGTGC CCATGGCACC
AATCCAGCCA CGTGCGTGAT GGCCGGTTTG CGGGTGGTGC CTGTGGCTTG TGATGCAGAG
GGGAATGTGG ATCTCAACGA TTTGGCTAGC AAGGCCGAGG CCCATGCTCC TCAGCTGGCA
GCCCTGATGG TCACCTATCC CTCAACCCAT GGTGTCTTTG AACCACAGAT TCGTGAAATC
TGTGAGCTCG TTCATGGTCA CGGTGGCCAG GTGTATCTCG ATGGCGCCAA TCTCAATGCG
CAGATTGGCT TTTGTCGTCC TGGGACTTAT GGCGCTGATG TTTGTCATAT CAATCTGCAC
AAGACCTTTT GTATCCCCCA TGGTGGTGGC GGACCAGGGG TGGGTCCGAT CGCAGTGGCT
GCTCATTTGA TGCCTTTCCT TCCAGGTCAT CCCCTAGCTG CCTGCGGCGG CGAGCAGGGA
ATCGGCGCAA TTTCGGCTGC GCCATGGGGA AGTGCAGGCA TCCTGCCGAT CAGCTGGATG
TACCTGCGCA TGATGGGTGC GGAGGGATTG CGGCAGGCCA GTGCTGTTGC ACTGTTATCG
GCTAATTACC TAGCCCATCG GTTGCATCCT CATTACCCAG TGCTGTTTCG GGGTCAAGCA
GGGTTGGTTG CCCATGAATG CATCCTCGAC TTGCGCCCCC TTAAGCGAAG TGCAGGCTTG
GAGGTGGATG ACATTGCAAA GCGGTTGATG GACTATGGCT TTCATGCCCC AACCGTGAGC
TGGCCGGTGG CAGGGACTGT GATGGTGGAA CCCACTGAGA GCGAGAGCTT GGAGGAGCTC
AACCGCTTCT GCGACGCCAT GATTGCCATA CGGGAAGAAA CGGCAGCGAT TGAGTCAGGC
CAGATTGATC CTCAGAACAA CCCGCTGCGA CGGGCGCCTC ACACCTTGGC GGCCGTAACA
GCAGAGGTTT GGGATCGTCC ATATTCCCGC GCCGAAGCTG CTTTCCCATT GGCTGAGCAG
CGCCAAAGCA AGTTTTGGCC TGCGGTTTCG CGAATTGATA ATGCCTATGG CGATCGCAAT
TTGCTGTGTA GCTGCCCATC GGTTGAGGAG CTTGCTGATA ACTCTGTTCT TAAACCTCCC
CTGGTGTGA
 
Protein sequence
MTLLEQRAVE ASSIQSSPFL VRHIGPSFED QQQMLLELGH RDLQSFVAAV VPPDILETTA 
PTSSLPEGCG EVQAMEELRL IAAANRVRRS LIGLGYYGTA TPALIQRQVL ENPAWYTAYT
PYQAEISQGR LEALFNFQTL ISELTGLPIA NASLLDEGTA AAEAMSLSFA ICKRPQAHRF
LVDAEVLPQT LAVLRTRAEP LGIHLEVAEP MTFQFDAEVF GVLLQLPGRS GRLWDPTTSI
QAAHEVGALA TVAIDPLAQV LIAPVAEFGA DIAVGSVQRF GVPMAFGGPH AAFFATLEIF
KRQVPGRLVG QSVDAEGQPA LRLALQTREQ HIRRDKATSN ICTAQVLLAV MASFYAVHHG
PDGLAAIARR VLLLRAQLER GLHQLGYPVQ SIARFDTIEV ICREAPAVHQ AAALAGFNLR
VLPLGVAPEA AHGFGISFDE LSTDQELKSI LQILAEAAGQ PVPVLEDLGN PHLEELVGLP
LRQRPWLQQQ VFHRYRSETE LLRYLQRLVG RDLSLVHGMI PLGSCTMKLN AAAELIPISW
REFAALHPFA PQDQCHGYQR LVQDLEHWFA EITGFAGVSL QPNAGSQGEL AGLLVIRAWH
HSRGEQQRDV CLIPTSAHGT NPATCVMAGL RVVPVACDAE GNVDLNDLAS KAEAHAPQLA
ALMVTYPSTH GVFEPQIREI CELVHGHGGQ VYLDGANLNA QIGFCRPGTY GADVCHINLH
KTFCIPHGGG GPGVGPIAVA AHLMPFLPGH PLAACGGEQG IGAISAAPWG SAGILPISWM
YLRMMGAEGL RQASAVALLS ANYLAHRLHP HYPVLFRGQA GLVAHECILD LRPLKRSAGL
EVDDIAKRLM DYGFHAPTVS WPVAGTVMVE PTESESLEEL NRFCDAMIAI REETAAIESG
QIDPQNNPLR RAPHTLAAVT AEVWDRPYSR AEAAFPLAEQ RQSKFWPAVS RIDNAYGDRN
LLCSCPSVEE LADNSVLKPP LV