Gene NATL1_21381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_21381 
SymbolgcvP 
ID4780859 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp1794270 
End bp1797176 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content36% 
IMG OID640085435 
Productglycine dehydrogenase 
Protein accessionYP_001015958 
Protein GI124026843 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating)
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.102572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAG CAGAATTAAA GGATTTCACT TTTAAGTCTC GCCATATAGG TCCAACTAAC 
GAAGATGAAG CTTTAATGCT TCAACATCTT GGTTATGAGA ATTCGGAAGA ATTTATATCT
TCGGTCATTC CCAATGAAAT ATTTGATTCA GAAAACAATG TTGTTTCTAT CCCAGATGGG
TGTGATCAAA ATAAAGCGTT AAAAGAAATC AATATAATTT CAAAGAAAAA TGTTGAACAT
CGTTCGTTGA TTGGTCTGGG TTATCATTCG ACTGTTATAC CTCCAGTTAT ACAAAGAAAT
GTTCTAGAAA ATCCTAATTG GTATACAGCT TATACTCCTT ATCAAGCAGA AATATCTCAA
GGCAGATTAG AAGCTTTATT TAATTTTCAG ACGTTAATAA GTGAATTAAC AGGCTTACCC
ATATCTAATG CGTCTTTACT TGATGAAGCC ACTGCGGCAG CTGAGGCAAT TAGTTTGAGT
TTGGCTGTTA GAAAAAATAA AAATGCTAAT AAGTTTTTAG TAGATCAAGA AATTTTGCCT
CAAACATTTG ATGTATTAAA AACTCGATGC GAACCGCTAG GCATTTCCCT TGAGATGTTT
GAAAACAATA ATTTTGAAAT CGATAAAAAT ATTTTTGGAA TTCTTATTCA GTTACCTGGG
AAAAATGGTC GTATTTGGGA CCCAACCAAA ATAATTAATG ACGCACATAA ATGTAATGCA
ATAGTGACCA TTGCAATTGA TCCTTTAGCT CAAGTGTTGA TTAAACCTAT GGGCGAATTT
GGTGCAGATA TTGTAGTTGG AAGTGCACAA AGATTTGGAG TTCCAATTGC TTGTGGTGGA
CCACATGCCG CTTTTTTTGC AACTAAAGAA ATATATAAAA GACAAATACC AGGAAGGATA
GTTGGGCAAT CAGTTGACGT AGAAGGAAAT CAAGCATTGA GGCTTGCTCT TCAAACTAGA
GAGCAACATA TTCGAAGAGA TAAAGCAACA AGTAATATAT GCACAGCTCA AGTTTTACTA
GCTGTTCTTT CTTCTTTTTA TGCAGTACAT CACGGCCCAA AAGGTCTTAA GCAAATAGCA
GAAAATGTGG TCAAATATAG ATCAAATTTT GAATCTATAT TAATGAATTT AGAATACCCT
ATAGAAAAAT ATTCAGCATT TGATAGTGTT GATGTTTATT GTTCAGAAGC ATCGGAAGTT
ATTCAGTTAG CGTCTGAAGA AGGATATAAC TTTAGAGTTC TTCCTATCGG ATCAGATTTT
GAAAATGCTA AAGGTTTTGG TGTGACTTTT GATGAATTAA CATGTGATGA AGAAATTTAT
ACATTGCATC AAATACTTGC ACAAGTTAAA GGAAAAAAAG CTCATGATCT TTCCAATTTT
CTTAATGAGA ATGCATCACT AGTTGATATT CCACTCCGAG AAAAATCTTG GCTTGAACAA
TCGGTATTTA ATCAATATCA AAGTGAGACT GATTTAATGA GATATATACA TAGTTTAGTA
TCGAAAGATT TTTCTTTAGT TCAAGGAATG ATTCCTCTTG GAAGTTGCAC AATGAAATTG
AATTCAGCTG CGGAACTTTT ACCCATTGAA TGGAGAGAGT TTTCCTCTAT TCATCCTTTT
GCTCCTCATG CTCAATTAGC TGGATTCCAC GAAATAATTA ATGACCTTGA AAATTGGCTG
TCTGCTTTAA CAGGCTTTCA GGGAGTTTCT CTTCAGCCAA ATGCAGGTTC TCAAGGAGAA
TTTGCTGGTT TGCTTGTAAT ACGTTCATGG CATCAGTCTC TGGGAGAAGG CCATAGGAAT
ATTTGCTTGA TTCCAACAAG TGCTCACGGT ACTAATCCGG CTAGTGCGGT AATGTCTGGC
TTTAAAGTTG TTTCTGTTAA ATGTGATGAA TATGGCAATG TTGATTTAGA AGACTTAAAA
AATAAATCAA AAATTCATTC AAAGAATTTG GCTGCATTGA TGGTTACTTA CCCTTCAACT
CATGGAGTAT TTGAGCCAAA TATCCGTGAG ATGTGCCAAG TAATTCATCA AGAGGGTGGT
CAGGTCTATT TGGATGGAGC AAATTTGAAT GCTCAAGTAG GTATTTGCAG GCCTGGATCT
TATGGCATAG ATGTATGCCA TTTAAATCTT CATAAAACTT TTTCCATTCC TCATGGTGGA
GGAGGCCCCG GAGTAGGTCC TATAGCTGTA GCAGATCATT TGGTTCCTTA TCTGCCTGGA
CATTCAATTA TCAAGTGCGG AGGCGAAAAG GCTATTTCGG CAGTATCTGC AGCTCCATTT
GGTAGCGCTG GAATATTGCC CATCAGTTGG ATGTACATCC GAATGATGGG TAGCGATGGG
TTACGCAAAG CGAGTTCTAT AGCAATTCTC TCTGCTAATT ATTTAGCAAA AAGACTTGAT
CCTTATTATC CAGTTTTATT TAAAGATCCC AATGGGCTAG TAGCTCATGA ATGCATATTA
GATTTACGAC CATTAAAGAG TCAGTTAGGC ATAGAAGTAG AGGATGTTGC CAAGAGATTG
ATGGATTATG GATTTCATGC ACCAACAATA AGTTGGCCTG TTGCTGGAAC TTTGATGGTT
GAACCAACAG AGAGTGAAAG TTTGCCTGAA TTAGATCGTT TTTGTGACGC GATGATAGGA
ATACGCGAAG AAATTGAACA AATAAAATTA GGAAAGATTG ATCCAATAAA TAATCCTTTA
AAACAATCTC CACATACCTT AAAAAGAGTT ACCTCGGATG ATTGGGATAG ACCTTACTCT
CGTAAAGAAG CTGCTTATCC TTTGCCTGAT CAAGAAAAAT ATAAATTTTG GCCATCAGTT
TCACGTATTA ATAATGCTTA TGGAGACAGG AATTTGATAT GTAGTTGTCC TTCAGTTCAA
GATTTAGAAG ATATTAATTC TGTTTGA
 
Protein sequence
MSKAELKDFT FKSRHIGPTN EDEALMLQHL GYENSEEFIS SVIPNEIFDS ENNVVSIPDG 
CDQNKALKEI NIISKKNVEH RSLIGLGYHS TVIPPVIQRN VLENPNWYTA YTPYQAEISQ
GRLEALFNFQ TLISELTGLP ISNASLLDEA TAAAEAISLS LAVRKNKNAN KFLVDQEILP
QTFDVLKTRC EPLGISLEMF ENNNFEIDKN IFGILIQLPG KNGRIWDPTK IINDAHKCNA
IVTIAIDPLA QVLIKPMGEF GADIVVGSAQ RFGVPIACGG PHAAFFATKE IYKRQIPGRI
VGQSVDVEGN QALRLALQTR EQHIRRDKAT SNICTAQVLL AVLSSFYAVH HGPKGLKQIA
ENVVKYRSNF ESILMNLEYP IEKYSAFDSV DVYCSEASEV IQLASEEGYN FRVLPIGSDF
ENAKGFGVTF DELTCDEEIY TLHQILAQVK GKKAHDLSNF LNENASLVDI PLREKSWLEQ
SVFNQYQSET DLMRYIHSLV SKDFSLVQGM IPLGSCTMKL NSAAELLPIE WREFSSIHPF
APHAQLAGFH EIINDLENWL SALTGFQGVS LQPNAGSQGE FAGLLVIRSW HQSLGEGHRN
ICLIPTSAHG TNPASAVMSG FKVVSVKCDE YGNVDLEDLK NKSKIHSKNL AALMVTYPST
HGVFEPNIRE MCQVIHQEGG QVYLDGANLN AQVGICRPGS YGIDVCHLNL HKTFSIPHGG
GGPGVGPIAV ADHLVPYLPG HSIIKCGGEK AISAVSAAPF GSAGILPISW MYIRMMGSDG
LRKASSIAIL SANYLAKRLD PYYPVLFKDP NGLVAHECIL DLRPLKSQLG IEVEDVAKRL
MDYGFHAPTI SWPVAGTLMV EPTESESLPE LDRFCDAMIG IREEIEQIKL GKIDPINNPL
KQSPHTLKRV TSDDWDRPYS RKEAAYPLPD QEKYKFWPSV SRINNAYGDR NLICSCPSVQ
DLEDINSV