Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_2022 |
Symbol | nusA |
ID | 3774209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 2090378 |
End bp | 2091703 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637800467 |
Product | transcription elongation factor NusA |
Protein accession | YP_401039 |
Protein GI | 81300831 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.142374 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAATGG TCACCCTGCC CGGCCTAGAG CAGCTGATCT ACGCCATTAG CGAGCAAAAA AAACTGCCCG CCAATGTCAT TGAAGAAGCC CTCAAAGAAG CCTTGCTCAA GGGCTACGAG CGCTATCGCC GTACCCAGCA GATGGGTGAG CAGTTTGAAG AAGACTACTT CGACAACATT GACGTTGAAC TCGATGTCGA ACAGGAAGGC TTTCGGGTAC TGGCAACCAA AACCATCGTC AATCAGGTCG AAAATCCTGA CCATCAGATT GCCCTCGCCG ATGTTCAGGA AGTGGCTCCA GATGCCCAAG CAGGCGAAAT CGTCGTTCTA GATGTCACAC CCGATAAAGA CGACTTTGGG CGAATGGCGG CTATTCAGAC TAAGCAAGTC CTGTCGCAAA AACTGCGCGA TCACCAGCGC AAACTGATCC AAGAAGAGTT CCAAGATCTA GAAGATCCGG TCTTGATGGC CAAGGTGCTG CGCTTCGAGC GCCAGTCTGT GATCTTGGGG GTCAGCAGTG GTTTAGGACG TCCTGAAGTC GAGGCAGAAC TGCCCCGTCG CGAACAACTG CCCAACGATA ACTACCGGGC CAACGCCACC TTCCGCGTCT TCCTCAAGGA AGTCAGTGAA GTACCCCGTC GTGGACCGCA GTTGATTGTC TCTCGGGCTA ACGCCGGTCT GGTTGTCTAC TTGTTCGAAA ACGAAGTTCC TGAAATCCAA GATGGTGTCG TCCGCATTGT GGCGGTAGCG CGGGAAGCGA ATCCGCCGAC TCGGCATGTT GGGCCGCGCA CCAAAATCGC TGTCGATACC TTGGAACGCG AAGTCGATCC GGTTGGGGCT TGCATCGGGG CGCGGGGATC GCGGATTCAG GTGGTCGTCA ACGAATTGCG GGGCGAAAAA ATTGATGTGA TCCGCTGGTC GCCGGATCCG GCCACCTATA TTGCCAATGC CCTCAGCCCT GCCCGGGTTC AGGAAGTGCG CTTGGTCGAT CCCGAAGGTC GGATCGCCCA CGTTTTGGTC AACGACGACC AACTCAGTCT CGCGATCGGC AAAGAGGGTC AGAACGTGCG CCTCGCGGCT CGACTGACCG GCTGGAAAAT CGACATCAAG GACGTGGCGC TCTACGACGC AGTCACGGAA GGTCAACGGA TTTCTGAACT GATTCAAGAA CGCCAAGAGC GGGCGGCGAT TGCTGCCGAA GAAGAAGCCC GTGCTGCCGC CGAAGCTGCT GAACTGGCGG AATGGGAGGC GGAAGAGGCT GCTCTCGCAG CCGCTGAAGC TGCGGCAGAA CTCGCAGCTG CTGAAGCTGA GGAAGAGACT GTTTGA
|
Protein sequence | MSMVTLPGLE QLIYAISEQK KLPANVIEEA LKEALLKGYE RYRRTQQMGE QFEEDYFDNI DVELDVEQEG FRVLATKTIV NQVENPDHQI ALADVQEVAP DAQAGEIVVL DVTPDKDDFG RMAAIQTKQV LSQKLRDHQR KLIQEEFQDL EDPVLMAKVL RFERQSVILG VSSGLGRPEV EAELPRREQL PNDNYRANAT FRVFLKEVSE VPRRGPQLIV SRANAGLVVY LFENEVPEIQ DGVVRIVAVA REANPPTRHV GPRTKIAVDT LEREVDPVGA CIGARGSRIQ VVVNELRGEK IDVIRWSPDP ATYIANALSP ARVQEVRLVD PEGRIAHVLV NDDQLSLAIG KEGQNVRLAA RLTGWKIDIK DVALYDAVTE GQRISELIQE RQERAAIAAE EEARAAAEAA ELAEWEAEEA ALAAAEAAAE LAAAEAEEET V
|
| |