Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1527 |
Symbol | |
ID | 3774951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | + |
Start bp | 1586243 |
End bp | 1588759 |
Gene Length | 2517 bp |
Protein Length | 838 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637799960 |
Product | nitrogen assimilation regulatory protein |
Protein accession | YP_400544 |
Protein GI | 81300336 |
COG category | [C] Energy production and conversion [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0348] Polyferredoxin [COG1221] Transcriptional regulators containing an AAA-type ATPase domain and a DNA-binding domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000360333 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAAACGA GCGACAAACA GCGCTGGCTG TTAGAGAACA CTTTGTTTAA TGGCCTGTCG GAGCGGGCAA TCGCCGCGAT CGCCACAGGA CTCTCGGAAG TTACGGTTCC CAGCGGTCAG GTTCTGTCTG ATCTCGACTC GTTACCGAAT GCGCTCTTGA TTTTGGTTCA CGGTGAACTC GATCGCCAGC AACCCATGAC AGGCCAGTGC GATCGCCTGT TGCCCGGTAG CGTACTCAAC CTGCGGGAAA TCCTGCTCCA GCAACCAGTC ACCCAGCAGG TCACGACAGC AACAGATGTC CTTGTCTGGC AACTGTCGGC CGAGCAACTG CAAGCGATCG CCACAGAGCT GAATGAACTT GATCGCTACC TCTCCGCACA ATTGGCAGCA GAGCTAGATG CGGTCACCGC TCAACTGCGT TTTGAACAGG CACGGTTGCG GGAGTTGCAG CCCTACGTTA TCCCCAAAAC TAAGCGGGGG ATTGTCGGTA GTAGTCGCTA TGCCCAACGG CTACGACAGG AAATTCGCCA AGCCTCGATC CGCAACGATC GTCAGCCGGT ATTGATTTTT GGCGAGCCAG GTCTGGGTAA AGACAACATT GCTGCCTTGA TCCACTTCGG TTCGCGGGAT CGACGTGAGC CCCTGATCAA AATCAACTGC AATACGCTGC AGCCCAATGG TGCGGAGCTA TTTGGCCGTA TTAATGAGCG ACGGGGACTG CTCGATTGGG TGGGCAAGGG CACAGTGCTG CTCAACAACG TCCAAGACTT ACCAACTGAT TTGCGATCGC GGGTGATTGA GCTCCTCGCC ACTGGCTACT ATCGCCCGCT ACCAACCCTG CAAGTACCGG AGCCAGAGCC CCAAGCGTGC CTAGCGCGCC TGATTTTGGT GGCGGAAACT AACCCTACTG ACTTGGCACG GCACTGCGTT CAAACCATCA AGGTGCCGCC TCTGCGGATC CGGAAAGCCG ATATCGTCGC CAGCGTCAAA TACTTCCTCA GTCGCTTTTG CCAAACGCGA CGCCAGCCTC GCCCGAAGCT AACGCCGGAA GCCGAACGCC AGCTGCAGAA CTACGACTAT CCCGGCAATA TTACTGAGCT GGAAAGTCTG GTGGAGCGGG CTTTAGTACA AAGCGGACAG GCTGCCGTGC TGACCGAAGA TGTCTTTTGG TTTGCCTCCA CGAAAGGCGA TCGCTTCCGC TGGAACCTGC TCAATGCCTA TCCTCGCCTG CGGCAACTGC TACGCAGCGA CTGGTGGCCA ACGCGGATCA ACTACGGCCT GATTCTCGGG GTGTATACGG CTGTGGTTGC CCTGCTGTTT TGGGGTCCGC AAACCCGCGC TGAGAACGTG GGGTTAACCC TGTTTTGGGC AGGCTGGTGG CCTCTGATTT TGCTGGCGTT TCCCTTTGTG GGGCGGCTTT GGTGTGCCTA TTGCCCCTTC ATGATCTACG GCGAACTGGT GCAGTGGGTT TCCCTGAAGC TCTGGCCGCG ATCGCTCTTG CCCTGGCCAC GGGCGGCAGC AGAGCGCTGG GGTGGCTGGT TCCTGTTTGG CCTCTTCGCC CTGATTTTGC TCTGGGAGGA GCTCTGGCAC CTCGAAGACG TTGCTTGGCT GTCGGCCTGT CTCTTGCTGC TGATTACCGC TGGGGCTGTT ACTTTTTCGC TGCTATTCGA GCGACGTTTT TGGTGTCGCT ATCTCTGCCC GATCGGTGGT ATGAATGGCC TCTTCGCCAA ATTGGCGGTA ATTGAACTGC GGGCCAAACG CGGTGTCTGC TCTGCCACCT GCAACACCTA TCAATGCTAC AAAGGTGGCC CTGCAAAAGG GGAAGGCCAA GAGACCATGG GTTGCCCGGT CTATTCCCAT CCGGCTCAGT TGGTGGATAA CCGCAACTGC GTTCTCTGCA TGACCTGTCT CAAGGCCTGC CCCCATCGCT CGGTTGAGCT GAATCTCCGG CCACCGGCGA TCGAACTATG GACCACCCAT GTCGCGACAC GATCGGAAGC TGCTCTGCTC TTCCTCTTGT TGGGTGCAGT GTTTCTGCAT CATCTGCCCC AGATTGCTCA GGTTCTAGGT TTAGGCGATC GCTGGCTCAC TTCCTTGGGA CTGCATGCCA TTTTGGCGAC GGCTGTCCTG GGTACGCCGG GGTTACTGGC CTTTTTCAGC GATCGCATTT TGCACGCTTG GCAACCCCGC CTCAAATCCT TTACCGAACT GGCCTATGGC TATCTGCCGC TGGTGTTGGC GGCGAGCCTC GCTCACTACC TTTGGATGGG GTTAACAGAA CTGGGACAAG TCCTACCACG CACGGCCCTT AGCTTTGGCT GGTCACCCAC CAATCTCCTG CAATACAGTG CTGACCCTGC CGTGATTGCC TTCTTGCAGG CGAGCAGCTT GATCTTAGGG CTCGTCACCA CCTTATTGGT CACTCAAAAA ATCGCCCGCC AGCCCTGGCG ATCGCTGCTG CCCCAACATA GTCTGGCCGT GGGGTTCACC AGCCTGCTCT GGCAGTTGAT TGTCTAG
|
Protein sequence | METSDKQRWL LENTLFNGLS ERAIAAIATG LSEVTVPSGQ VLSDLDSLPN ALLILVHGEL DRQQPMTGQC DRLLPGSVLN LREILLQQPV TQQVTTATDV LVWQLSAEQL QAIATELNEL DRYLSAQLAA ELDAVTAQLR FEQARLRELQ PYVIPKTKRG IVGSSRYAQR LRQEIRQASI RNDRQPVLIF GEPGLGKDNI AALIHFGSRD RREPLIKINC NTLQPNGAEL FGRINERRGL LDWVGKGTVL LNNVQDLPTD LRSRVIELLA TGYYRPLPTL QVPEPEPQAC LARLILVAET NPTDLARHCV QTIKVPPLRI RKADIVASVK YFLSRFCQTR RQPRPKLTPE AERQLQNYDY PGNITELESL VERALVQSGQ AAVLTEDVFW FASTKGDRFR WNLLNAYPRL RQLLRSDWWP TRINYGLILG VYTAVVALLF WGPQTRAENV GLTLFWAGWW PLILLAFPFV GRLWCAYCPF MIYGELVQWV SLKLWPRSLL PWPRAAAERW GGWFLFGLFA LILLWEELWH LEDVAWLSAC LLLLITAGAV TFSLLFERRF WCRYLCPIGG MNGLFAKLAV IELRAKRGVC SATCNTYQCY KGGPAKGEGQ ETMGCPVYSH PAQLVDNRNC VLCMTCLKAC PHRSVELNLR PPAIELWTTH VATRSEAALL FLLLGAVFLH HLPQIAQVLG LGDRWLTSLG LHAILATAVL GTPGLLAFFS DRILHAWQPR LKSFTELAYG YLPLVLAASL AHYLWMGLTE LGQVLPRTAL SFGWSPTNLL QYSADPAVIA FLQASSLILG LVTTLLVTQK IARQPWRSLL PQHSLAVGFT SLLWQLIV
|
| |