Gene P9301_18581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_18581 
SymbolgcvP 
ID4910949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1583281 
End bp1586190 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content33% 
IMG OID640161463 
Productglycine dehydrogenase 
Protein accessionYP_001092082 
Protein GI126697196 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating)
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCCA AATTTGGGTC TGATTTGTTT ATAGATAGGC ATCTTGGGTT AGGAGATAAT 
GATGAAAGAA TTATGTTGAA CAAGCTTGGT TTTAATAATA TTGATCAATT TATAAATCAA
GTTATTCCCG AAGATATTCA GATTAAAGAC AAATCTTCAG AAATATTGCC CCAAGGTTGT
TCAGAAATTG AAGCTTTAAA TGAATTAGAA GAGATTGCGA ATAAAAATAC CAAAATTAGA
TCACTTATAG GCCTTGGTTA TTATGACAAT CACATGCCTA AAGTAATCCA AAGACATGTT
CTTGAAAATC CAAGGTGGTA CACGTCTTAT ACTCCATATC AAGCAGAAAT TGCACAAGGA
AGATTAGAAG CTCTATTTAA TTTTCAGACT ATTGTTTGTG AACTAACAGG ATTCCCTGTC
GCCAATGCAT CTTTGTTGGA TGAGGGTACT GCTGCTGCAG AAGCTATGGC CATGAGTTTT
TCCGCAAGAA AAAATAAATC TTCAAAAGTG TACTTAGTGG AATCAAATGT TTTTGATCAT
ACTTTTAATG TTCTACAAAC CAGAGCAAAA CCTTTGGGAA TATCCTTAAA ACGCTTTACT
CAAAGCAAAC TTCCTAATCA TGATGATGTT TTTGGAATGT TGTTGCAATT ACCTGGTAAA
AATGGGGAAT TATATGATCC CACATTCTTA ATATCCCAAG CACATAGATC AGAAATTATC
GTTACGGCAT GTATTGATCC ACTAGCACAA GTTTTGATTA AACCAATTTC TGAATTTGGT
GTTGATGTAG CAGTGGGGAG TATGCAAAGA TTTGGTGTTC CAATGGGTTT TGGTGGCCCC
CATGCAGCAT ATTTTGCTTG TAGCGAAAAA TATAAAAGGC TGATACCCGG AAGAATTGTT
GGGCAAACTC TATCTAAAAA TGGAGAAAAG TCTCTAAGAC TAGCATTGCA AACAAGAGAG
CAACATATTA GAAGGGAAAA GGCCACTAGT AATATTTGCA CTGCTCAATC TTTGTTAGCC
ATAATTTCTT CTTTTTATGC TATTTATCAT GGACCCTCTG GATTAACTAA AATAGCTAAG
AGATTAGTTG AGTTGAGAAT AAATTTAGAA TCAAGTTTAG CTGCTTTGGG TTTTGATATT
CCTGATGGGA TTAGATTTGA TAGTGTTGAT GTTTATTCTG AGCACTCCCA TAGGATCCAT
AATGAAGCTT TAAAAAATGG CTATAACTTA AGAATTTTGC CGTTAGGATC AACTATTGAA
AATTCAACTG GCTTTGGGAT CTCTTTAGAT GAGCTTAGTA ATGAAAAAGA AATAAAAGAT
ATTTTGACTT TCATAGCAAA CATTATAGAA AAAGAAGAAG ATTTAGAGCA TATAAAATTT
GATAAAGATT TTCATCTTGA AAGTTTAGCT TTGAGATCCA GTGCATGGAT GCAGCAAGAT
ATATTTACAA ATTACCAAAG TGAAACTGAA TTAATGAGAT ATATATTTCG ACTTGCAGAA
AAAGATTTTT CTTTGGTAGA TGGGATGATG CCATTGGGAA GCTGTACCAT GAAGTTAAAT
TCTGCAGCAG AGTTAAATCC AGTCTCTTGG GCTAATTTAT CTTCCATGCA TCCTTTTTCC
CCACCAGATC AAACTAAAGG CTATTCAAAA ATTATATCTG ACTTAGAAAA ATGGATAAGT
GAGATTGTTG GTTTAAAATC AGTTTCTTTT CAACCAAATG CAGGCTCTCA AGGAGAGTTT
GCAGGTTTAT TGGCAATTAA TTCTTATTTT GAATCAAAAG GTGAACTGTT AAGAAAAAAA
TGTTTAATTC CAAAAAGTGC TCATGGAACA AATCCTGCTA GTGCAGTTAT GGCAGGTTTT
GAAGTGTTAA CTGTTGAATG TGATGACGAA GGAAATATTG ATTATCAAGA TTTGTCACTC
AAGGTCAAGA AATTTGACAA CCAAATAGGG GCTCTTATGT TGACTTATCC CTCTACTCAT
GGAGTTTTTG AATTACAAAT CAGAAAGATA TGTGATTTGA TTCATTCTGT GGGAGGATTT
GTTTATTTAG ATGGAGCAAA TTTGAATGCT CAGGTTGGAT TATGCAAACC TGGAAATTAT
GGCGTTGATG TTTGTCATTT GAATTTACAT AAAACATTCT GCATTCCACA TGGAGGTGGT
GGTCCAGGAG TAGGTCCAGT TGCTGCATCA GAAACTTTAA GCCCATACCT TCCTACTCAT
TTTTTAATGG ATAATAATTT ATCTAATAGT TTTAATTTCG TATCTTCTGC CAAGCATGGG
AGTGCAAGTA TTCTTCCAAT AAGTTGGATG TATATAAAAA TGGCAGGTCT TAGTGGGTTA
AGGAAAGCAA CTTCGCACGC AATTCTATCT GCAAATTATA TTGCGCATTC CTTAAAACAT
AAATTCAAGA TTCTCTATAA AGGAAAAAAT AATTTTGTCG CACATGAATG TATTTTAGAT
TTTAGAGATT TAAAATCCAA AACTGGATTG AGTGTCAATG ATTTAGCTAA ACGATTAATT
GATTATAGTT TTCATGCCCC AACTATTAGT TGGCCTGTTC CAGAAACCAT AATGATAGAG
CCTACTGAAA GTGAAAGTTT GGCAGAATTA GATAGATTTT GTGAGGCTAT GCTATTGATT
GGAGAAGAAA TCAGTGAAAT AGAAAATAAT CATGAATTAA AAAATAATAA TGTAATAAGT
AATGCTCCCC ATACTCTGAA AGAGTTAATT GCTGATAATT GGCAATATCC TTATTCAAAA
GAAAAGGCTT CTTTCCCTTA TAAAACTCCA ACAAATACTA AGTTTTGGTC TTCAGTTTCT
AGAATCAATA ATGCATATGG CGATCGAAAT TTAATTTGTT CTTGTAATGT AAATCAAAAT
GAGACTTTAG AAGAAAAAAA ATGTGCTTAA
 
Protein sequence
MTSKFGSDLF IDRHLGLGDN DERIMLNKLG FNNIDQFINQ VIPEDIQIKD KSSEILPQGC 
SEIEALNELE EIANKNTKIR SLIGLGYYDN HMPKVIQRHV LENPRWYTSY TPYQAEIAQG
RLEALFNFQT IVCELTGFPV ANASLLDEGT AAAEAMAMSF SARKNKSSKV YLVESNVFDH
TFNVLQTRAK PLGISLKRFT QSKLPNHDDV FGMLLQLPGK NGELYDPTFL ISQAHRSEII
VTACIDPLAQ VLIKPISEFG VDVAVGSMQR FGVPMGFGGP HAAYFACSEK YKRLIPGRIV
GQTLSKNGEK SLRLALQTRE QHIRREKATS NICTAQSLLA IISSFYAIYH GPSGLTKIAK
RLVELRINLE SSLAALGFDI PDGIRFDSVD VYSEHSHRIH NEALKNGYNL RILPLGSTIE
NSTGFGISLD ELSNEKEIKD ILTFIANIIE KEEDLEHIKF DKDFHLESLA LRSSAWMQQD
IFTNYQSETE LMRYIFRLAE KDFSLVDGMM PLGSCTMKLN SAAELNPVSW ANLSSMHPFS
PPDQTKGYSK IISDLEKWIS EIVGLKSVSF QPNAGSQGEF AGLLAINSYF ESKGELLRKK
CLIPKSAHGT NPASAVMAGF EVLTVECDDE GNIDYQDLSL KVKKFDNQIG ALMLTYPSTH
GVFELQIRKI CDLIHSVGGF VYLDGANLNA QVGLCKPGNY GVDVCHLNLH KTFCIPHGGG
GPGVGPVAAS ETLSPYLPTH FLMDNNLSNS FNFVSSAKHG SASILPISWM YIKMAGLSGL
RKATSHAILS ANYIAHSLKH KFKILYKGKN NFVAHECILD FRDLKSKTGL SVNDLAKRLI
DYSFHAPTIS WPVPETIMIE PTESESLAEL DRFCEAMLLI GEEISEIENN HELKNNNVIS
NAPHTLKELI ADNWQYPYSK EKASFPYKTP TNTKFWSSVS RINNAYGDRN LICSCNVNQN
ETLEEKKCA