Gene Synpcc7942_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_2047 
Symbol 
ID3774266 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp2118510 
End bp2121371 
Gene Length2862 bp 
Protein Length953 aa 
Translation table11 
GC content58% 
IMG OID637800492 
Productglycine dehydrogenase 
Protein accessionYP_401064 
Protein GI81300856 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain
[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID[TIGR00461] glycine dehydrogenase (decarboxylating) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCTT CGCCCCACAA CTTTGCTCAG CGGCATCTCG GGCCACGGCC GGCCGATGTC 
GAGCAGATGT TGCAGAAGTT AGGTTGCGAG AGCCTAGAAG ACTTGCTGGC GGCGGTAGTC
CCTGCAGATA TTCGTTTGCC ACGATCGCTG AATTTGCCTG AGCCCTGCAG TGAGGCCGAA
GCGCTGGCGG AATTGCGAGC GATCGCCCAT CAAAATCAGA TCCTGCGCTC CTATCTTGGC
CAAGGCTATG CCAACTGCCT GACGCCGCCT GTAATTCAGC GTAATATTCT CGAAAATCCG
GGCTGGTACA CCGCCTACAC GCCCTACCAA GCCGAGATTG CCCAAGGACG CTTGGAAGCA
CTGCTCAACT TCCAGACCAT GGTCAGCGAT CTGACGGGGC TGGAGATCGC CAACGCCTCC
CTGTTGGATG AGGCAACAGC GGCAGCCGAA GCGATGACCC TCAGTTTGGC AGTCGCGAAG
TCAAAGTCTC AGACCTACTT CGTTGCCCAC AACTGCCATC CGCAAACGAT CGCGGTGGTA
CAGACTCGGG CTGCTGCACT GGGGATTGAA GTGCTCGTCG CAGATCTGCT GCAGTTTGAC
TTCCAGACGC CGATTTTTGG ACTGCTGCTG CAATATCCCG CCACCGACGG CACGATCGCA
GATTACCGCT CGGTGATTGA GCAAGCCCAT GCTCAGGGCG CGATCGCAAC CGTTGCCTGC
GACTTGCTAG CACTAACCCT GCTGACCCCT CCAGGGGAAT TTGGCGCGGA TATCGCGGTT
GGGAATAGTC AGCGGTTCGG CGTGCCCCTC GGTTACGGCG GTCCTCATGC GGCATTCTTT
GCCACCAAGG AAGCCTACAA ACGGCAGATT CCGGGGCGGA TTGTCGGTGT CTCTAAGGAT
GCCCAAGGTC AACCAGCACT GCGCTTGGCG TTGCAGACGC GGGAGCAACA TATTCGTCGC
GACAAGGCCA CGAGCAATAT CTGCACGGCG CAGGTCTTGC TGGCTGTGGT GGCTGGCTTC
TACGCGGTCT ACCACGGGGC AGAAGGACTG ACCGCGATCG CGAGGCAAGT GCGTCGCCAG
ACTCAGATCT TGGCGGAGGA GTTGCAGTCT CTCGGATTCA AGATTCCTCA GCAGCCGGGC
TTTGACACGC TGATCGTTGA GGTCGAAGAC CCGAAAGTTT GGCAGTCGCG AACTGAAGCA
GCGGGTTTTA ATCTGCGTTG TCTGAGCGAT CGCCAGCTTG GTATCAGCCT CGATGAAACG
ACGACTGACA GCGATCTCCT CGACCTGCTC ACTGTTTTTG CTCAAGGGCG ATCGTTGCCA
GCTTGGGAGG ATCTACAAGC GGCTGTGACT GACGAAGTGG ATCCAGCCTT CGCCCGCCAA
ACGCCCTTCC TGACCCATCC CGTCTTTCAG CAGTACCACT CGGAAACCGA GTTGCTGCGC
TATATCCATC GCCTCCAGAG TCGTGATCTA TCGCTGACTA CAGCGATGAT TCCGCTCGGC
TCCTGCACGA TGAAGCTCAA CGCCACGGCG GAGATGCTAC CGATCAGTTG GCCGGAGTTT
AATCAGATTC ACCCCTTTGC ACCGCTGAGT CAAACCCAGG GTTATCAACA GCTGTTCCAG
CAGCTTGAGT CTTGGCTAGC CGAAATTACG GGCTTCGCAG CGGTCTCCCT ACAACCCAAT
GCTGGCTCTC AAGGGGAATA TGCGGGTCTA CTCGTCATCC AGCGCTACCA CCAGAGTCGC
GGCGAAGATC ACCGCCAGAT TTGCCTGATT CCGCAGTCGG CTCACGGGAC TAATCCCGCC
AGCGCGGTGA TGGCTGGCAT GAAAGTCGTG CCGATCGCCT GTGACGATCG CGGCAACATT
GATGTCAGTG ACCTGCAGCA AAAAGCTGCC CAGTATGCGG ATCAGCTCGC AGCACTGATG
GTCACCTATC CCTCTACTCA CGGCGTCTTT GAGGAAGCGA TCGCGGAGAT CTGTGCGATC
GTTCATCAGC AGGGCGGCCA AGTTTATTTA GATGGTGCCA ATCTCAACGC CCAAGTCGGC
CTCTGTCAGC CCGCCCAATT TGGGGCGGAT GTCTGTCATC TCAACCTCCA CAAGACCTTT
TGCATTCCCC ACGGCGGTGG TGGCCCCGGC GTTGGCCCGA TCGGTGTTGC CGCGCACCTT
GCGCCCTTCC TGCCGAGTCA TCCGCTCGTC CCAGAAGCGA ATGCCGATCC GCAAGCCCTT
GGCCCGATCG CAGCCGCCCC TTGGGGGAGT GCCAGCATCC TGCCCATTTC TTGGATGTAT
ATCCGCATGA TGGGTGCAGC TGGGTTGACG CAAGCCAGCG CAATCGCAAT TCTCAACGCC
AACTACATTG CCACACGACT AGCGCCCTAC TATCCAATCC TCTATCGGGG CGATCGCGGC
TTTGTTGCCC ACGAATGTAT CCTTGACCTA CGACCGCTCA AACGCACAGC CGGGATTGAA
GTCGAGGATG TCGCCAAACG GCTGATGGAC TACGGCTTTC ATGCGCCAAC CATGTCTTGG
CCCGTGCTCG GCACGTTGAT GGTCGAGCCA ACCGAGAGTG AATCGCTGGC AGAACTCGAT
CGCTTCTGTG AAGCGATGAT CGGCATTTAT CACGAGGTGG ACGCGATCGC CAGCGGTGAC
TTGGATCCCC TCGACAATCC CCTCAAGCAT GCGCCCCACC CGGCAGATGT GCTGCTCCAG
TCTGACTGGA ATCGCGCCTA CAGCCGCGAG CAGGCCGCTT ATCCTGCCCC TTGGACGCGA
GAACACAAAT TCTGGCCAGT GGTCAGCCGC ATCGATAACG CCTACGGCGA TCGCAATCTC
GTCTGCTCCT GTCTACCCAT GAGCGCCTAC AGCGATCGCT GA
 
Protein sequence
MSASPHNFAQ RHLGPRPADV EQMLQKLGCE SLEDLLAAVV PADIRLPRSL NLPEPCSEAE 
ALAELRAIAH QNQILRSYLG QGYANCLTPP VIQRNILENP GWYTAYTPYQ AEIAQGRLEA
LLNFQTMVSD LTGLEIANAS LLDEATAAAE AMTLSLAVAK SKSQTYFVAH NCHPQTIAVV
QTRAAALGIE VLVADLLQFD FQTPIFGLLL QYPATDGTIA DYRSVIEQAH AQGAIATVAC
DLLALTLLTP PGEFGADIAV GNSQRFGVPL GYGGPHAAFF ATKEAYKRQI PGRIVGVSKD
AQGQPALRLA LQTREQHIRR DKATSNICTA QVLLAVVAGF YAVYHGAEGL TAIARQVRRQ
TQILAEELQS LGFKIPQQPG FDTLIVEVED PKVWQSRTEA AGFNLRCLSD RQLGISLDET
TTDSDLLDLL TVFAQGRSLP AWEDLQAAVT DEVDPAFARQ TPFLTHPVFQ QYHSETELLR
YIHRLQSRDL SLTTAMIPLG SCTMKLNATA EMLPISWPEF NQIHPFAPLS QTQGYQQLFQ
QLESWLAEIT GFAAVSLQPN AGSQGEYAGL LVIQRYHQSR GEDHRQICLI PQSAHGTNPA
SAVMAGMKVV PIACDDRGNI DVSDLQQKAA QYADQLAALM VTYPSTHGVF EEAIAEICAI
VHQQGGQVYL DGANLNAQVG LCQPAQFGAD VCHLNLHKTF CIPHGGGGPG VGPIGVAAHL
APFLPSHPLV PEANADPQAL GPIAAAPWGS ASILPISWMY IRMMGAAGLT QASAIAILNA
NYIATRLAPY YPILYRGDRG FVAHECILDL RPLKRTAGIE VEDVAKRLMD YGFHAPTMSW
PVLGTLMVEP TESESLAELD RFCEAMIGIY HEVDAIASGD LDPLDNPLKH APHPADVLLQ
SDWNRAYSRE QAAYPAPWTR EHKFWPVVSR IDNAYGDRNL VCSCLPMSAY SDR