Gene Synpcc7942_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1420 
Symbol 
ID3773592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1473509 
End bp1474909 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content55% 
IMG OID637799852 
Productlight-independent protochlorophyllide reductase subunit N 
Protein accessionYP_400437 
Protein GI81300229 
COG category[C] Energy production and conversion 
COG ID[COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains 
TIGRFAM ID[TIGR01279] light-independent protochlorophyllide reductase, N subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.172263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACTA CCGAAGCACC TTCAGCACTC TCTTTTGAGT GTGAAACTGG CAACTATCAC 
ACTTTTTGTC CCATTAGTTG CGTGGCTTGG CTTTATCAAA AAATTGAAGA TAGTTTCTTC
TTGGTGATTG GCACAAAAAC CTGCGGCTAT TTTTTGCAAA ATGCCATGGG TGTGATGATC
TTTGCAGAGC CACGCTATGC GATGGCAGAG CTGGAAGAGG GAGACATTTC CGCTCAGCTT
AATGACTATG CAGAACTCAA ACGACTCTGC ACACAAATCA AACGCGATCG CAATCCCAGC
GTAATTGTTT GGATTGGTAC TTGCACGACT GAAATTATCA AGATGGATCT GGAGGGACTC
GCTCCCAAGC TAGAAGCTGA GATTGGCATT CCGATCGTGG TCGCTCGTGC CAATGGTTTG
GACTATGCCT TCACCCAAGG CGAGGACACC GTGTTAGCTG CTATGGCCGC TCGCTGCCCT
GAGGCTGCTA CCAGCGAAGC AGATCAACAA GAACGGACTA ACGCTATCCA GCGTCTGCTC
CAGTTTGGGA AATCCCCTGC CGCCGAGCAG CAGCCTGCTA GTTCCAAGCA CCCGCCCCTG
ATCCTGTTTG GCTCGGTGCC CGATCCTGTT GCCACCCAAC TCACGATCGA GCTGGCGAAG
CAAGGGATTA CGGTCTCGGG TTGGTTGCCC GCTAAGCGCT ATACCGAACT ACCGGTCATC
GCTGAAGGGA GTTACGCGAT CGGCTTGAAT CCGTTTTTGT CCAGAACAGC TACAACCCTG
ATGCGCCGCC GTAAATGCAA AGTGATCGGC GCTCCCTTCC CCATTGGACC CGATGGCAGT
CGCGCTTGGA TCGAGAAAAT CTGCAGCGTT CTGGAGATTG AGCCCCAAGG CTTAGCTGAG
CGGGAAGCTC AAGTTTGGGA CAGCATCGAA GACTATCGTC AGCTTGTCGA GGGCAAACAA
GTCTTCTTCA TGGGCGACAA CCTCTGGGAA ATTTCCCTGG CTCGCTTCCT GGTCCGCTGC
GGGATGCGCT GTCCTGAAAT TGGCATCCCC TACCTCGATC GCCGCTACCT AGGGGCTGAG
CTGGCAATGC TTGAAGCCAC CTGCCAAAGC ATGGGAGTCC CCCTACCACG CTTGGTTGAG
AAACCGGACA ACTACAACCA ACTGCAGCGG ATCGAGGCAC TACAGCCCGA CCTAGTCATT
ACCGGCATGG CCCACGCTAA TCCTCTAGAG GCTCGCGGCA TCAGCACCAA GTGGTCGGTG
GAGTTCACCT TTGCCCAGAT CCATGGCTTC GGCAATGCCC GCGCCATCCT AGAGCTGGTG
ACTCGCCCCC TCCGTCGTAA CCTCGCATTG GGCACATTGG GCGGCAGTCA ATGGGTGAGC
GAAGCTGTTA CCTCACGCTA G
 
Protein sequence
MTTTEAPSAL SFECETGNYH TFCPISCVAW LYQKIEDSFF LVIGTKTCGY FLQNAMGVMI 
FAEPRYAMAE LEEGDISAQL NDYAELKRLC TQIKRDRNPS VIVWIGTCTT EIIKMDLEGL
APKLEAEIGI PIVVARANGL DYAFTQGEDT VLAAMAARCP EAATSEADQQ ERTNAIQRLL
QFGKSPAAEQ QPASSKHPPL ILFGSVPDPV ATQLTIELAK QGITVSGWLP AKRYTELPVI
AEGSYAIGLN PFLSRTATTL MRRRKCKVIG APFPIGPDGS RAWIEKICSV LEIEPQGLAE
REAQVWDSIE DYRQLVEGKQ VFFMGDNLWE ISLARFLVRC GMRCPEIGIP YLDRRYLGAE
LAMLEATCQS MGVPLPRLVE KPDNYNQLQR IEALQPDLVI TGMAHANPLE ARGISTKWSV
EFTFAQIHGF GNARAILELV TRPLRRNLAL GTLGGSQWVS EAVTSR