Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_3391 |
Symbol | nusA |
ID | 8392727 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | + |
Start bp | 3457613 |
End bp | 3458851 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 644981328 |
Product | transcription elongation factor NusA |
Protein accession | YP_003139054 |
Protein GI | 257061166 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0485206 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.931935 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCCTTG TGAGTTTGCC AGGGTTGGCA AATATGATTG AAGAAATTAG CCAACTACAC AACCTACCCA AATCCGCCGT ACAAGAAGCT TTACGAGAAG CCTTGCTTAA AGGCTATGAA CGCTACCGTC GTTCCCAAAA CTTAGAACGT CAGGCCTTTC ACGAAGATTA CTTCGACAAT TTTGAAGTGG AACTGGACAC CGAAGAAGAA GGGTTTCGTA TTCTATCGAC GAAAAAAATT GTCGAAGCCG TCGAGAATAC TGATCATTTT ATTTCCCTCG AAGAAGTCCA AGAAGTCGCT TCCGAAGCGC AACTGGGGGA TGAAGTCGTC TTAGATGTTA CCCCTAACCA AAAAGATTTT GGGCGCATGG CTGCCATTCA AACGAAACAG GTTCTGCTGC AAAAACTGCG AGATCAGCAG AGAAAACTGA TTCAAGAAGA ATTTAACGAA ATAGAAGGAA CGGTGCTCAA CGCCAGAGTT TTGCGGTTTG AGCGACAAGA CGCGATCGTT GCCGTGCAAA GTACCTTTGG ACAACCCGAA GTCGAAGCCG TTTTACCCAA ACGCGAACAA CTCCCCAACG ATAATTACCG AGCTAATGCC ACCTTTAAAG TCTTACTCAA AAAAGTCCGT GAAGGCTCCC ACCGAGGACC TCAATTGATT GTCTCCCGTT CGGCAGCCGG GTTAGTGGTA GATTTGTTTA CCGTTGAAGT CCCTGAAATT GAAGAAGAAA TCGTGCGGAT CGTGGCGGTT TCCCGCGAAG CGAACCCCCC TTCGCGCCAT GTAGGGCCGC GGACGAAAAT AGCCGTCGAT ACCCTCGAAA GGGATGTTGA TCCCGTGGGA GCGTGTATTG GAGCTAGGGG ATCGCGTATT CAAGCTGTCG TTAATGAACT GCGAGGAGAA AAAATCGATG TGATTCGGTG GTCTCCTGAT CCAGCTACTT ATATTGCCAA TGCGTTAAGT CCAGCTAGGG TGGATAATGT TATCTTAATT AATCCTGATG AACGTCATGC CCTTGTCTTA GTAGCCGAAG ATCAACTCAG TTTAGCCATT GGGAAAGAAG GGCAAAACGT GCGCCTAGCA GCTCGTTTGA CGGGATGGAA AATCGATATT AAGGATACAG CGACTTATCA AGCAGAAGTG GAACAAAAAA AACACGCACA GAATCAGACA GCCATTGATC CCCTAGAAGA AGACCCCAAT CTTCCCTTAA CCGAGATGTC AGAAACACAG ATAGGTTAG
|
Protein sequence | MSLVSLPGLA NMIEEISQLH NLPKSAVQEA LREALLKGYE RYRRSQNLER QAFHEDYFDN FEVELDTEEE GFRILSTKKI VEAVENTDHF ISLEEVQEVA SEAQLGDEVV LDVTPNQKDF GRMAAIQTKQ VLLQKLRDQQ RKLIQEEFNE IEGTVLNARV LRFERQDAIV AVQSTFGQPE VEAVLPKREQ LPNDNYRANA TFKVLLKKVR EGSHRGPQLI VSRSAAGLVV DLFTVEVPEI EEEIVRIVAV SREANPPSRH VGPRTKIAVD TLERDVDPVG ACIGARGSRI QAVVNELRGE KIDVIRWSPD PATYIANALS PARVDNVILI NPDERHALVL VAEDQLSLAI GKEGQNVRLA ARLTGWKIDI KDTATYQAEV EQKKHAQNQT AIDPLEEDPN LPLTEMSETQ IG
|
| |