Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0368 |
Symbol | nusA |
ID | 4569346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 410366 |
End bp | 411922 |
Gene Length | 1557 bp |
Protein Length | 518 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 639764966 |
Product | transcription elongation factor NusA |
Protein accession | YP_910851 |
Protein GI | 119356207 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00190694 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAGAA AGCAGGTAAA AACGGAGGGG CAGGACAGGA AAGCGCAGAT TGCCAGTGCT TTCGGAGAAA TCGAGCAGTC CAAAATCTTT CTGGACAAAC GTACGGAAAG TGCGGCTGTC AAAATGGATA TTGCAGACCT TCTTAAAGAT ATCATCCAGA AGCAGCTCAG AAAGGATTAT GATCCTGAAG TTGAGGCTAA TATTTTCATT AATCCCGAGC GCGGCGATTT TGAGGTTTAT ATTCTGAAAA AGGTTGTCAG TGAAGTTGAT CTCGAAAGCA TTGAAATCAG CATCGATGAA GTCAGAAAAA TAGACGATTC TCTTGAACTC GGAGATTATT ATGAAGAGGG CCCGATCAAG CTTGAGGATT ATCTGAGCAG AAAGTCTATC CAGATTATCA AGCAGTCTGT ACAGAAAAAG GTACGGGATC TTGAGCGTCT TGTGGTGTAT GAGGAGTGCC TTGAAAAAGT TGGCGAGGTT GTGGCCGGTG AAGTATATCA GGTTCGGTCA AACGAAGTGA TATTTACCTA CAACACATCA AAAGATCATC GTGTTGAGCT TGTTCTGCCA AGAGCCGAAA TGATAAAAAA GGATAATCCT CGCAGAAACC CGAGGATGAA GCTCTACGTC AAGCGCATCG AACGCGAGAA GGTTAAGGTA AGGCTTGATG ACGGCGGCGT TGTCGAGCGC GATAAACCTG ATGGGGGAAT GAAAGTAATT GTTTCAAGGA TTGACGATCG CTTTTTGTAC AAACTGTTTG AACATGAGGT TCCGGAAATT CTCGATGGTC TTATTGTGAT AAAAGGAATT GCCCGTGTTC CAGGCGAAAG GGCAAAGGTT GCCGTTGAGT CCACCAGTGC CAGAATAGAT CCTGTAGGAG CAAGCGTAGG GTATCGCGGA AAAAGGATAC AGAGTATCGT CAAGGAACTG AACAATGAGA ATATCGATGT TATTTATTAT ACCGATGAAC CGCAGATTTA TATAGCTCGT GCTTTGCAGC CAGCCAAGAT TGATCCCTTG ACGGTTCATG CCGATATTAA AACCCGAAAG GCAAGGGTAA TGCTGAAGCC GGACCAGATC AAGTATGCCA TTGGAAAAAA CGGTAATAAC ATACATCTGG CCGAAAAGCT TACCGGATAT GAAATCGATG TGTATCGCGA TGTGATGGAC AAGTCGATGG AAGATCCTAC TGATATTGAT ATTATTGAGT TCAGGGAAGA GTTTGGCGAT GATATGATCT ATCAATTGCT TGACGGTGGC CTTGATACGG CCAAGAAGGT ACTGAAAGCA GGTGTTGAAA GGATTGAGGA GGTTTTGCTC GGCCCTTCAG CTCCAGAGGA GATTACTTTT TTTTCAAAAG GTCGAACCAG GAGCCCGATT AAGCCAAGAG AGCGAAAGGT AACCGAAGAG GAAAAGCGGT ACTGGAAAAA AATTGCCGAA AATATTTTTA AAACCGTTAA AGAGCAGTTT ACTGATGCTG ATTTGCATGA TCTGTTCGAC GATGGAGATG AAGACAGTGA CGATCAACAA GCCGCTGAAA GTCCGGCAGA TGAATGA
|
Protein sequence | MVRKQVKTEG QDRKAQIASA FGEIEQSKIF LDKRTESAAV KMDIADLLKD IIQKQLRKDY DPEVEANIFI NPERGDFEVY ILKKVVSEVD LESIEISIDE VRKIDDSLEL GDYYEEGPIK LEDYLSRKSI QIIKQSVQKK VRDLERLVVY EECLEKVGEV VAGEVYQVRS NEVIFTYNTS KDHRVELVLP RAEMIKKDNP RRNPRMKLYV KRIEREKVKV RLDDGGVVER DKPDGGMKVI VSRIDDRFLY KLFEHEVPEI LDGLIVIKGI ARVPGERAKV AVESTSARID PVGASVGYRG KRIQSIVKEL NNENIDVIYY TDEPQIYIAR ALQPAKIDPL TVHADIKTRK ARVMLKPDQI KYAIGKNGNN IHLAEKLTGY EIDVYRDVMD KSMEDPTDID IIEFREEFGD DMIYQLLDGG LDTAKKVLKA GVERIEEVLL GPSAPEEITF FSKGRTRSPI KPRERKVTEE EKRYWKKIAE NIFKTVKEQF TDADLHDLFD DGDEDSDDQQ AAESPADE
|
| |