Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0401 |
Symbol | nusA |
ID | 6374063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 424695 |
End bp | 426260 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 642682918 |
Product | transcription elongation factor NusA |
Protein accession | YP_001958847 |
Protein GI | 189499377 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000166 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.293714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGAA AGCAGGTAAA AAAAGAAAAG CAGGATCGTA GAGAGCTTAT CGCAAATGCT TTTGGTGAAA TTGAGCAGTC GAAGGTTTTT CTGGAAAAAC ATACGGAGAG CGCCGCAGTG AAGATGGATA TTGCGGATCT TCTGAAGGAT ATTATACAGA AGCAGTTGCG CAAGGACTAC GACCCCGAGG TTGAGGCTAA TATTTTTATC AATCCGGAGC GGGGCGACTT CGAGGTGTAT ATTCTGAAAA CAGTTGTTGA TGAGGTTGAT CTTCCTTCCA TTGAGATCGG ACTTGAGGAG GTTAGCAGGA TCGACGAGTC TCTTGAGCTT GGGGACATGT ATGAAGAGGG TCCTGTCAAC CTCGAGGATT ATCTGACCAG AAAGTCGATA CAGATAATCA AGCAGTCGGT TCAGAAAAAG GTGCGGGACA TGGAAAAGCA GGTGGTGTAT GAGGATTGTC TCGAGAAAGT TGGTGAAGTT GTCGCAGGGG AGGTCTATCA GGTCAGGCAG AACGAGGTTA TTTTTTCCTA CAACACCTCA AAGGATCACC GTGTCGAACT GGTACTTCCG AAATCCGAAA TGATGAAAAA GGACAATCCA AGGCGCACCC CGAGGATGAA GCTCTATGTG AAAAGGATAG AGAGAGAAAA AGTGAAAGTC AGACAGGATG ATGGCTCCAT TGTCGAGAAA GAGAAGCCTG ATGGCGGGAT GAAGGTTATT GTTTCCAGAA TCGACGACCG GTTTCTCTAC AAGCTTTTTG AAAGTGAAGT GCCTGAAATT CTGGATGGGC TTATTGTCAT CAAGGGGATT GCAAGAGTTC CGGGTGAACG GGCCAAGGTT GCTGTTGAGT CGACAAGTTC GCGGATAGAT CCTGTAGGGG CGAGCGTGGG ATACCGGGGA AAGCGTATTC AGAGTATAGT CAAGGAACTC AACAATGAGA ATATCGATGT TATCAATTTT ACCGACGATC CACAGATCTA TATCGCCCGG GCACTTCAGC CGGCGAAAAT CGATCCTATG ACAGTTCATG CGGATATGAA GACGCACAAA GCCAGAGTGA TGCTCAAACC TGAGCAGATC AAGTACGCCA TAGGTAAAAA CGGTAACAAT ATTCATTTGG CCGAGCGTCT TACCGGTTAT GATGTGGATG TCTACAGGGA CGTTATCGAT AAATCCATGG AGGATCCTAA CGATATCGAT ATTATAGAGT TCCGCGAAGA ATTCGGCGAT GATATGATCT ACCAGCTTCT CGACAGCGGG CTTGATACAG CCAAAAAAGT GCTCAAGGCT GAAATAGAAG ATATTGAAGC TGCTCTGGTA GGGCCGCCTT CTAAAAGCGA GGAATCAGCG TTCTTCACCA AAGGAAGAAA AGCTCCGTTC AAACCCAAAG AGAGAACGCT CAGCGAAGAT GAGAAACGGT ATTGGAAAAA GATCGCTGAA AATATTTACA AGACAGTGAA GGAGCAGTTT AACGAGGCGG ATCTGCAGGA GATAATAGAT GAAGAGGACG AAGATATACT GAGTGATGGC GATGCAGATG TGAGCGTTGA TGATCAGAAC AACTGA
|
Protein sequence | MARKQVKKEK QDRRELIANA FGEIEQSKVF LEKHTESAAV KMDIADLLKD IIQKQLRKDY DPEVEANIFI NPERGDFEVY ILKTVVDEVD LPSIEIGLEE VSRIDESLEL GDMYEEGPVN LEDYLTRKSI QIIKQSVQKK VRDMEKQVVY EDCLEKVGEV VAGEVYQVRQ NEVIFSYNTS KDHRVELVLP KSEMMKKDNP RRTPRMKLYV KRIEREKVKV RQDDGSIVEK EKPDGGMKVI VSRIDDRFLY KLFESEVPEI LDGLIVIKGI ARVPGERAKV AVESTSSRID PVGASVGYRG KRIQSIVKEL NNENIDVINF TDDPQIYIAR ALQPAKIDPM TVHADMKTHK ARVMLKPEQI KYAIGKNGNN IHLAERLTGY DVDVYRDVID KSMEDPNDID IIEFREEFGD DMIYQLLDSG LDTAKKVLKA EIEDIEAALV GPPSKSEESA FFTKGRKAPF KPKERTLSED EKRYWKKIAE NIYKTVKEQF NEADLQEIID EEDEDILSDG DADVSVDDQN N
|
| |