Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_2047 |
Symbol | |
ID | 3774266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 2118510 |
End bp | 2121371 |
Gene Length | 2862 bp |
Protein Length | 953 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637800492 |
Product | glycine dehydrogenase |
Protein accession | YP_401064 |
Protein GI | 81300856 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCTT CGCCCCACAA CTTTGCTCAG CGGCATCTCG GGCCACGGCC GGCCGATGTC GAGCAGATGT TGCAGAAGTT AGGTTGCGAG AGCCTAGAAG ACTTGCTGGC GGCGGTAGTC CCTGCAGATA TTCGTTTGCC ACGATCGCTG AATTTGCCTG AGCCCTGCAG TGAGGCCGAA GCGCTGGCGG AATTGCGAGC GATCGCCCAT CAAAATCAGA TCCTGCGCTC CTATCTTGGC CAAGGCTATG CCAACTGCCT GACGCCGCCT GTAATTCAGC GTAATATTCT CGAAAATCCG GGCTGGTACA CCGCCTACAC GCCCTACCAA GCCGAGATTG CCCAAGGACG CTTGGAAGCA CTGCTCAACT TCCAGACCAT GGTCAGCGAT CTGACGGGGC TGGAGATCGC CAACGCCTCC CTGTTGGATG AGGCAACAGC GGCAGCCGAA GCGATGACCC TCAGTTTGGC AGTCGCGAAG TCAAAGTCTC AGACCTACTT CGTTGCCCAC AACTGCCATC CGCAAACGAT CGCGGTGGTA CAGACTCGGG CTGCTGCACT GGGGATTGAA GTGCTCGTCG CAGATCTGCT GCAGTTTGAC TTCCAGACGC CGATTTTTGG ACTGCTGCTG CAATATCCCG CCACCGACGG CACGATCGCA GATTACCGCT CGGTGATTGA GCAAGCCCAT GCTCAGGGCG CGATCGCAAC CGTTGCCTGC GACTTGCTAG CACTAACCCT GCTGACCCCT CCAGGGGAAT TTGGCGCGGA TATCGCGGTT GGGAATAGTC AGCGGTTCGG CGTGCCCCTC GGTTACGGCG GTCCTCATGC GGCATTCTTT GCCACCAAGG AAGCCTACAA ACGGCAGATT CCGGGGCGGA TTGTCGGTGT CTCTAAGGAT GCCCAAGGTC AACCAGCACT GCGCTTGGCG TTGCAGACGC GGGAGCAACA TATTCGTCGC GACAAGGCCA CGAGCAATAT CTGCACGGCG CAGGTCTTGC TGGCTGTGGT GGCTGGCTTC TACGCGGTCT ACCACGGGGC AGAAGGACTG ACCGCGATCG CGAGGCAAGT GCGTCGCCAG ACTCAGATCT TGGCGGAGGA GTTGCAGTCT CTCGGATTCA AGATTCCTCA GCAGCCGGGC TTTGACACGC TGATCGTTGA GGTCGAAGAC CCGAAAGTTT GGCAGTCGCG AACTGAAGCA GCGGGTTTTA ATCTGCGTTG TCTGAGCGAT CGCCAGCTTG GTATCAGCCT CGATGAAACG ACGACTGACA GCGATCTCCT CGACCTGCTC ACTGTTTTTG CTCAAGGGCG ATCGTTGCCA GCTTGGGAGG ATCTACAAGC GGCTGTGACT GACGAAGTGG ATCCAGCCTT CGCCCGCCAA ACGCCCTTCC TGACCCATCC CGTCTTTCAG CAGTACCACT CGGAAACCGA GTTGCTGCGC TATATCCATC GCCTCCAGAG TCGTGATCTA TCGCTGACTA CAGCGATGAT TCCGCTCGGC TCCTGCACGA TGAAGCTCAA CGCCACGGCG GAGATGCTAC CGATCAGTTG GCCGGAGTTT AATCAGATTC ACCCCTTTGC ACCGCTGAGT CAAACCCAGG GTTATCAACA GCTGTTCCAG CAGCTTGAGT CTTGGCTAGC CGAAATTACG GGCTTCGCAG CGGTCTCCCT ACAACCCAAT GCTGGCTCTC AAGGGGAATA TGCGGGTCTA CTCGTCATCC AGCGCTACCA CCAGAGTCGC GGCGAAGATC ACCGCCAGAT TTGCCTGATT CCGCAGTCGG CTCACGGGAC TAATCCCGCC AGCGCGGTGA TGGCTGGCAT GAAAGTCGTG CCGATCGCCT GTGACGATCG CGGCAACATT GATGTCAGTG ACCTGCAGCA AAAAGCTGCC CAGTATGCGG ATCAGCTCGC AGCACTGATG GTCACCTATC CCTCTACTCA CGGCGTCTTT GAGGAAGCGA TCGCGGAGAT CTGTGCGATC GTTCATCAGC AGGGCGGCCA AGTTTATTTA GATGGTGCCA ATCTCAACGC CCAAGTCGGC CTCTGTCAGC CCGCCCAATT TGGGGCGGAT GTCTGTCATC TCAACCTCCA CAAGACCTTT TGCATTCCCC ACGGCGGTGG TGGCCCCGGC GTTGGCCCGA TCGGTGTTGC CGCGCACCTT GCGCCCTTCC TGCCGAGTCA TCCGCTCGTC CCAGAAGCGA ATGCCGATCC GCAAGCCCTT GGCCCGATCG CAGCCGCCCC TTGGGGGAGT GCCAGCATCC TGCCCATTTC TTGGATGTAT ATCCGCATGA TGGGTGCAGC TGGGTTGACG CAAGCCAGCG CAATCGCAAT TCTCAACGCC AACTACATTG CCACACGACT AGCGCCCTAC TATCCAATCC TCTATCGGGG CGATCGCGGC TTTGTTGCCC ACGAATGTAT CCTTGACCTA CGACCGCTCA AACGCACAGC CGGGATTGAA GTCGAGGATG TCGCCAAACG GCTGATGGAC TACGGCTTTC ATGCGCCAAC CATGTCTTGG CCCGTGCTCG GCACGTTGAT GGTCGAGCCA ACCGAGAGTG AATCGCTGGC AGAACTCGAT CGCTTCTGTG AAGCGATGAT CGGCATTTAT CACGAGGTGG ACGCGATCGC CAGCGGTGAC TTGGATCCCC TCGACAATCC CCTCAAGCAT GCGCCCCACC CGGCAGATGT GCTGCTCCAG TCTGACTGGA ATCGCGCCTA CAGCCGCGAG CAGGCCGCTT ATCCTGCCCC TTGGACGCGA GAACACAAAT TCTGGCCAGT GGTCAGCCGC ATCGATAACG CCTACGGCGA TCGCAATCTC GTCTGCTCCT GTCTACCCAT GAGCGCCTAC AGCGATCGCT GA
|
Protein sequence | MSASPHNFAQ RHLGPRPADV EQMLQKLGCE SLEDLLAAVV PADIRLPRSL NLPEPCSEAE ALAELRAIAH QNQILRSYLG QGYANCLTPP VIQRNILENP GWYTAYTPYQ AEIAQGRLEA LLNFQTMVSD LTGLEIANAS LLDEATAAAE AMTLSLAVAK SKSQTYFVAH NCHPQTIAVV QTRAAALGIE VLVADLLQFD FQTPIFGLLL QYPATDGTIA DYRSVIEQAH AQGAIATVAC DLLALTLLTP PGEFGADIAV GNSQRFGVPL GYGGPHAAFF ATKEAYKRQI PGRIVGVSKD AQGQPALRLA LQTREQHIRR DKATSNICTA QVLLAVVAGF YAVYHGAEGL TAIARQVRRQ TQILAEELQS LGFKIPQQPG FDTLIVEVED PKVWQSRTEA AGFNLRCLSD RQLGISLDET TTDSDLLDLL TVFAQGRSLP AWEDLQAAVT DEVDPAFARQ TPFLTHPVFQ QYHSETELLR YIHRLQSRDL SLTTAMIPLG SCTMKLNATA EMLPISWPEF NQIHPFAPLS QTQGYQQLFQ QLESWLAEIT GFAAVSLQPN AGSQGEYAGL LVIQRYHQSR GEDHRQICLI PQSAHGTNPA SAVMAGMKVV PIACDDRGNI DVSDLQQKAA QYADQLAALM VTYPSTHGVF EEAIAEICAI VHQQGGQVYL DGANLNAQVG LCQPAQFGAD VCHLNLHKTF CIPHGGGGPG VGPIGVAAHL APFLPSHPLV PEANADPQAL GPIAAAPWGS ASILPISWMY IRMMGAAGLT QASAIAILNA NYIATRLAPY YPILYRGDRG FVAHECILDL RPLKRTAGIE VEDVAKRLMD YGFHAPTMSW PVLGTLMVEP TESESLAELD RFCEAMIGIY HEVDAIASGD LDPLDNPLKH APHPADVLLQ SDWNRAYSRE QAAYPAPWTR EHKFWPVVSR IDNAYGDRNL VCSCLPMSAY SDR
|
| |