Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_21381 |
Symbol | gcvP |
ID | 4780859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | - |
Start bp | 1794270 |
End bp | 1797176 |
Gene Length | 2907 bp |
Protein Length | 968 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640085435 |
Product | glycine dehydrogenase |
Protein accession | YP_001015958 |
Protein GI | 124026843 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.102572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAAG CAGAATTAAA GGATTTCACT TTTAAGTCTC GCCATATAGG TCCAACTAAC GAAGATGAAG CTTTAATGCT TCAACATCTT GGTTATGAGA ATTCGGAAGA ATTTATATCT TCGGTCATTC CCAATGAAAT ATTTGATTCA GAAAACAATG TTGTTTCTAT CCCAGATGGG TGTGATCAAA ATAAAGCGTT AAAAGAAATC AATATAATTT CAAAGAAAAA TGTTGAACAT CGTTCGTTGA TTGGTCTGGG TTATCATTCG ACTGTTATAC CTCCAGTTAT ACAAAGAAAT GTTCTAGAAA ATCCTAATTG GTATACAGCT TATACTCCTT ATCAAGCAGA AATATCTCAA GGCAGATTAG AAGCTTTATT TAATTTTCAG ACGTTAATAA GTGAATTAAC AGGCTTACCC ATATCTAATG CGTCTTTACT TGATGAAGCC ACTGCGGCAG CTGAGGCAAT TAGTTTGAGT TTGGCTGTTA GAAAAAATAA AAATGCTAAT AAGTTTTTAG TAGATCAAGA AATTTTGCCT CAAACATTTG ATGTATTAAA AACTCGATGC GAACCGCTAG GCATTTCCCT TGAGATGTTT GAAAACAATA ATTTTGAAAT CGATAAAAAT ATTTTTGGAA TTCTTATTCA GTTACCTGGG AAAAATGGTC GTATTTGGGA CCCAACCAAA ATAATTAATG ACGCACATAA ATGTAATGCA ATAGTGACCA TTGCAATTGA TCCTTTAGCT CAAGTGTTGA TTAAACCTAT GGGCGAATTT GGTGCAGATA TTGTAGTTGG AAGTGCACAA AGATTTGGAG TTCCAATTGC TTGTGGTGGA CCACATGCCG CTTTTTTTGC AACTAAAGAA ATATATAAAA GACAAATACC AGGAAGGATA GTTGGGCAAT CAGTTGACGT AGAAGGAAAT CAAGCATTGA GGCTTGCTCT TCAAACTAGA GAGCAACATA TTCGAAGAGA TAAAGCAACA AGTAATATAT GCACAGCTCA AGTTTTACTA GCTGTTCTTT CTTCTTTTTA TGCAGTACAT CACGGCCCAA AAGGTCTTAA GCAAATAGCA GAAAATGTGG TCAAATATAG ATCAAATTTT GAATCTATAT TAATGAATTT AGAATACCCT ATAGAAAAAT ATTCAGCATT TGATAGTGTT GATGTTTATT GTTCAGAAGC ATCGGAAGTT ATTCAGTTAG CGTCTGAAGA AGGATATAAC TTTAGAGTTC TTCCTATCGG ATCAGATTTT GAAAATGCTA AAGGTTTTGG TGTGACTTTT GATGAATTAA CATGTGATGA AGAAATTTAT ACATTGCATC AAATACTTGC ACAAGTTAAA GGAAAAAAAG CTCATGATCT TTCCAATTTT CTTAATGAGA ATGCATCACT AGTTGATATT CCACTCCGAG AAAAATCTTG GCTTGAACAA TCGGTATTTA ATCAATATCA AAGTGAGACT GATTTAATGA GATATATACA TAGTTTAGTA TCGAAAGATT TTTCTTTAGT TCAAGGAATG ATTCCTCTTG GAAGTTGCAC AATGAAATTG AATTCAGCTG CGGAACTTTT ACCCATTGAA TGGAGAGAGT TTTCCTCTAT TCATCCTTTT GCTCCTCATG CTCAATTAGC TGGATTCCAC GAAATAATTA ATGACCTTGA AAATTGGCTG TCTGCTTTAA CAGGCTTTCA GGGAGTTTCT CTTCAGCCAA ATGCAGGTTC TCAAGGAGAA TTTGCTGGTT TGCTTGTAAT ACGTTCATGG CATCAGTCTC TGGGAGAAGG CCATAGGAAT ATTTGCTTGA TTCCAACAAG TGCTCACGGT ACTAATCCGG CTAGTGCGGT AATGTCTGGC TTTAAAGTTG TTTCTGTTAA ATGTGATGAA TATGGCAATG TTGATTTAGA AGACTTAAAA AATAAATCAA AAATTCATTC AAAGAATTTG GCTGCATTGA TGGTTACTTA CCCTTCAACT CATGGAGTAT TTGAGCCAAA TATCCGTGAG ATGTGCCAAG TAATTCATCA AGAGGGTGGT CAGGTCTATT TGGATGGAGC AAATTTGAAT GCTCAAGTAG GTATTTGCAG GCCTGGATCT TATGGCATAG ATGTATGCCA TTTAAATCTT CATAAAACTT TTTCCATTCC TCATGGTGGA GGAGGCCCCG GAGTAGGTCC TATAGCTGTA GCAGATCATT TGGTTCCTTA TCTGCCTGGA CATTCAATTA TCAAGTGCGG AGGCGAAAAG GCTATTTCGG CAGTATCTGC AGCTCCATTT GGTAGCGCTG GAATATTGCC CATCAGTTGG ATGTACATCC GAATGATGGG TAGCGATGGG TTACGCAAAG CGAGTTCTAT AGCAATTCTC TCTGCTAATT ATTTAGCAAA AAGACTTGAT CCTTATTATC CAGTTTTATT TAAAGATCCC AATGGGCTAG TAGCTCATGA ATGCATATTA GATTTACGAC CATTAAAGAG TCAGTTAGGC ATAGAAGTAG AGGATGTTGC CAAGAGATTG ATGGATTATG GATTTCATGC ACCAACAATA AGTTGGCCTG TTGCTGGAAC TTTGATGGTT GAACCAACAG AGAGTGAAAG TTTGCCTGAA TTAGATCGTT TTTGTGACGC GATGATAGGA ATACGCGAAG AAATTGAACA AATAAAATTA GGAAAGATTG ATCCAATAAA TAATCCTTTA AAACAATCTC CACATACCTT AAAAAGAGTT ACCTCGGATG ATTGGGATAG ACCTTACTCT CGTAAAGAAG CTGCTTATCC TTTGCCTGAT CAAGAAAAAT ATAAATTTTG GCCATCAGTT TCACGTATTA ATAATGCTTA TGGAGACAGG AATTTGATAT GTAGTTGTCC TTCAGTTCAA GATTTAGAAG ATATTAATTC TGTTTGA
|
Protein sequence | MSKAELKDFT FKSRHIGPTN EDEALMLQHL GYENSEEFIS SVIPNEIFDS ENNVVSIPDG CDQNKALKEI NIISKKNVEH RSLIGLGYHS TVIPPVIQRN VLENPNWYTA YTPYQAEISQ GRLEALFNFQ TLISELTGLP ISNASLLDEA TAAAEAISLS LAVRKNKNAN KFLVDQEILP QTFDVLKTRC EPLGISLEMF ENNNFEIDKN IFGILIQLPG KNGRIWDPTK IINDAHKCNA IVTIAIDPLA QVLIKPMGEF GADIVVGSAQ RFGVPIACGG PHAAFFATKE IYKRQIPGRI VGQSVDVEGN QALRLALQTR EQHIRRDKAT SNICTAQVLL AVLSSFYAVH HGPKGLKQIA ENVVKYRSNF ESILMNLEYP IEKYSAFDSV DVYCSEASEV IQLASEEGYN FRVLPIGSDF ENAKGFGVTF DELTCDEEIY TLHQILAQVK GKKAHDLSNF LNENASLVDI PLREKSWLEQ SVFNQYQSET DLMRYIHSLV SKDFSLVQGM IPLGSCTMKL NSAAELLPIE WREFSSIHPF APHAQLAGFH EIINDLENWL SALTGFQGVS LQPNAGSQGE FAGLLVIRSW HQSLGEGHRN ICLIPTSAHG TNPASAVMSG FKVVSVKCDE YGNVDLEDLK NKSKIHSKNL AALMVTYPST HGVFEPNIRE MCQVIHQEGG QVYLDGANLN AQVGICRPGS YGIDVCHLNL HKTFSIPHGG GGPGVGPIAV ADHLVPYLPG HSIIKCGGEK AISAVSAAPF GSAGILPISW MYIRMMGSDG LRKASSIAIL SANYLAKRLD PYYPVLFKDP NGLVAHECIL DLRPLKSQLG IEVEDVAKRL MDYGFHAPTI SWPVAGTLMV EPTESESLPE LDRFCDAMIG IREEIEQIKL GKIDPINNPL KQSPHTLKRV TSDDWDRPYS RKEAAYPLPD QEKYKFWPSV SRINNAYGDR NLICSCPSVQ DLEDINSV
|
| |