Gene A9601_18771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_18771 
SymbolgcvP 
ID4718615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1611287 
End bp1614196 
Gene Length2910 bp 
Protein Length969 aa 
Translation table11 
GC content34% 
IMG OID640079611 
Productglycine dehydrogenase 
Protein accessionYP_001010267 
Protein GI123969409 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating)
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATCTA AATTTGGGTC TGATTTGTTT GTAGATAGGC ATCTTGGGTT AGGAGATAAT 
GACGAAAGAA TTATGCTGAA CAAGCTTGGT TTTAATAATA TTGATCAATT TATAAATCAA
GTTATTCCTG AAGATATTCA GCTTAAAGAT AAATCTTCAG AAATATTGCC CCAAGGTTGT
TCAGAAATTG AGGCTTTAAA CGAATTAGAA GAGATTGCGA ATAAAAATAC TAAAATGAGA
TCACTTATAG GTCTTGGTTA TTATGACAAT CATATGCCTA AAGTAATCCA AAGACATGTT
CTTGAAAATC CAAGGTGGTA CACGTCTTAT ACTCCATATC AAGCAGAAAT TGCACAAGGA
AGATTAGAAG CTCTATTTAA TTTTCAGACT ATTGTTTGTG AACTAACAGG ATTCCCTGTC
GCCAATGCAT CTTTGTTGGA TGAGGGTACT GCTGCAGCAG AATCCATGGC CATGAGTTTT
TCGGCAAGAA AAAATAAATC TTCAAAAGTT TACTTAGTGG AGTCAAATGT TTTTGATCAT
ACTTTTAATG TTCTACAAAC CAGAGCAAAA CCTTTGGGAA TATCTTTAAA ACGCTTTACT
CAAAGCAACC TTCCTAATCA TGATGATGTT TTTGGAATGC TGTTGCAATT ACCTGGTAAA
AATGGTCAAT TATATGATCC CACATTCTTA ATATCCCAAG CACATCGATC AGAAATTATT
GTTACGGCAT GTATTGATCC ACTAGCACAA GTTTTGATTA AACCAATTTC TGAATTTGGT
GTTGATGTAG CAGTGGGTAG TATGCAAAGA TTTGGTGTAC CAATGGGTTT TGGCGGTCCC
CATGCAGCAT ATTTTGCTTG TAGCGAGCAA TATAAAAGGC TGATACCCGG AAGAATTGTT
GGGCAAACTC TCTCTAAAAA TGGAGAAAAA TCTCTAAGAC TAGCATTGCA AACAAGAGAG
CAACATATTA GAAGGGAAAA GGCCACCAGT AATATTTGTA CTGCTCAATC TTTATTAGCC
ATAATTTCTT CTTTTTATGC TATTTATCAT GGACCCTCTG GATTAACGCA AATTGCTAAG
AGATTAGTTG AGTTGAGAAT AAATTTAGAA TCAAGTTTAG CTGCTTTAGG TTTTGATATT
CCTGATGGGA TAAGATTTGA TAGTGTTGAT GTTTATTCTG AGCACTCCCA GAGGATCCAT
AATGAAGCTT TAAAAAATGG CTACAACTTA AGAATTTTGC CGTTGGGATC AACTATTGAA
AATTCAACTG GCTTTGGGAT CTCTTTAGAT GAGCTTAGTA ATGAAAAAGA AATAAAAGAT
ATTTTGACTT TCATAGCAAA CCTTTTAGAA AAAGAAGATG ATTTAGAGCA TATAAAATTT
GATAAAGAAT TTCATCTTGA AAGTTTAGCT TTGAGATCCA GTGAATGGAT GCAGCAAGAT
ATATTCACAA ATTATCAAAG TGAAACTGAA TTAATGAGAT ATATATTCCG ACTTGCAGAA
AAAGATTTTT CTTTGGTAGA TGGGATGATG CCATTGGGAA GCTGTACCAT GAAGTTAAAT
TCTGCAGCAG AGTTAAATCC AGTCTCTTGG GCTAATTTAT CTTCCATGCA TCCTTTTTCC
CCACTAGATC AAACTAAAGG CTATTCAAAA ATTATATCTG ACCTAGAAAA ATGGATAAGT
GAGATTGTTG GTTTAAAATC AGTTTCTTTT CAACCAAATG CAGGCTCTCA AGGAGAGTTT
GCAGGTTTAT TGGCAATTAA TTCTTATTTT GAATCAAGAG GTGAACTGTT AAGAAAAAAA
TGTTTAATTC CAAAAAGTGC TCATGGAACA AATCCTGCTA GTGCAGTTAT GGCAGGTTTT
GACGTGTTAA CTGTTGAATG TGATGACGAA GGAAATATTG ATTATCAAGA TTTGTCAATT
AAGGTCAAGA AATTTGATAA CCAAATAGGG GCTCTTATGT TGACTTATCC CTCTACTCAT
GGAGTTTTTG AATTACAAAT CAGAAAGATA TGTGATTTAA TTCATTCTGT GGGAGGATTT
GTCTATTTAG ATGGAGCAAA TTTGAACGCT CAGGTTGGAT TATGCAAACC TGGAAACTAT
GGGGTTGATG TTTGTCATTT GAACTTACAT AAAACATTCT GCATTCCGCA TGGAGGTGGT
GGTCCAGGAG TAGGTCCAGT TGCTGCATCA GAAACTTTAA GCCCGTACCT TCCTACTCAT
TCTTTAATGG ATAATAATTT ATCTAATAGT TTAAATTTCG TATCTTCTGC CAAGCATGGG
AGTGCAAGTA TCCTTCCAAT AAGTTGGATG TATATAAAAA TGGCAGGTCT TAGTGGGTTA
AGGAAAGCAA CTTCGCACGC AATTTTATCT GCAAATTATA TTGCGCATTC CTTAAAACAT
AAATTCAAGA TTCTTTATAA AGGAAAAAAT AATTTTGTCG CACATGAATG TATTTTAGAT
TTTAGAGATT TAAAATCCAA AACTGGATTG AGTGTCAATG ATTTAGCTAA ACGATTAATT
GATTATAGTT TTCATGCCCC AACTATTAGT TGGCCTGTGC CAGAAACCAT AATGATAGAG
CCTACTGAAA GTGAAAGTTT GGCAGAAGTA GATAGATTTT GTGAGGCTAT GCTATTGATT
GGAGAAGAAA TCAGTGAAAT AGAAAATAAT TATGAATTAA AAAATAATAA TGTAATAAGC
AATTCTCCCC ATACGTTGAA AGAGTTAATT GCTGATAATT GGCAATATCC TTATTCAAAA
GAAAAGGCTT CTTTCCCTTA TAAAACTCCA ACAACTATTA AGTTTTGGTC TTCAGTTTCT
AGGATCAATA ATGCATATGG CGATCGCAAT TTAATTTGTT CTTGCAATGT AAATCAAGAT
GAGACTTTAG AAGAAAAAAA ATGTGCTTAA
 
Protein sequence
MTSKFGSDLF VDRHLGLGDN DERIMLNKLG FNNIDQFINQ VIPEDIQLKD KSSEILPQGC 
SEIEALNELE EIANKNTKMR SLIGLGYYDN HMPKVIQRHV LENPRWYTSY TPYQAEIAQG
RLEALFNFQT IVCELTGFPV ANASLLDEGT AAAESMAMSF SARKNKSSKV YLVESNVFDH
TFNVLQTRAK PLGISLKRFT QSNLPNHDDV FGMLLQLPGK NGQLYDPTFL ISQAHRSEII
VTACIDPLAQ VLIKPISEFG VDVAVGSMQR FGVPMGFGGP HAAYFACSEQ YKRLIPGRIV
GQTLSKNGEK SLRLALQTRE QHIRREKATS NICTAQSLLA IISSFYAIYH GPSGLTQIAK
RLVELRINLE SSLAALGFDI PDGIRFDSVD VYSEHSQRIH NEALKNGYNL RILPLGSTIE
NSTGFGISLD ELSNEKEIKD ILTFIANLLE KEDDLEHIKF DKEFHLESLA LRSSEWMQQD
IFTNYQSETE LMRYIFRLAE KDFSLVDGMM PLGSCTMKLN SAAELNPVSW ANLSSMHPFS
PLDQTKGYSK IISDLEKWIS EIVGLKSVSF QPNAGSQGEF AGLLAINSYF ESRGELLRKK
CLIPKSAHGT NPASAVMAGF DVLTVECDDE GNIDYQDLSI KVKKFDNQIG ALMLTYPSTH
GVFELQIRKI CDLIHSVGGF VYLDGANLNA QVGLCKPGNY GVDVCHLNLH KTFCIPHGGG
GPGVGPVAAS ETLSPYLPTH SLMDNNLSNS LNFVSSAKHG SASILPISWM YIKMAGLSGL
RKATSHAILS ANYIAHSLKH KFKILYKGKN NFVAHECILD FRDLKSKTGL SVNDLAKRLI
DYSFHAPTIS WPVPETIMIE PTESESLAEV DRFCEAMLLI GEEISEIENN YELKNNNVIS
NSPHTLKELI ADNWQYPYSK EKASFPYKTP TTIKFWSSVS RINNAYGDRN LICSCNVNQD
ETLEEKKCA