Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TRQ2_1047 |
Symbol | nusA |
ID | 6092479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermotoga sp. RQ2 |
Kingdom | Bacteria |
Replicon accession | NC_010483 |
Strand | + |
Start bp | 1091581 |
End bp | 1092615 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642488242 |
Product | transcription elongation factor NusA |
Protein accession | YP_001739077 |
Protein GI | 170288839 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000699393 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATAG GCTTGCTGGA AGCCCTCGAT CAGCTGGAGG AAGAAAAAGG AATTTCCAAA GAAGAGGTCA TCCCTATCCT CGAAAAAGCG CTGGTGAGCG CCTACAGGAA GAACTTTGGA AATTCCAAGA ACGTGGAGGT TGTTATAGAT AGGAACACAG GAAACATAAG GGTATATCAA CTCCTCGAGG TTGTGGAAGA AGTGGAAGAT CCAGCGACAC AGATATCTCT CGAGGAGGCG AAAAAGATCG ATCCCCTCGC GGAAGTTGGG TCTATTGTGA AGAAGGAGCT AAACGTTAAG AATTTTGGAA GAATAGCCGC GCAGACAGCA AAGCAGGTTC TCATCCAGAG AATCAGAGAA CTCGAGAAGG AAAAACAGTT CGAGAAGTAT TCCGAGCTCA AAGGAACGGT TACAACCGCT GAAGTCATAA GAGTCACAAG CGAGTGGGCG GATATCAGAA TAGGAAAACT CGAGACAAGG CTTCCAAAGA AAGAGTGGAT CCCCGGTGAG GAAATCAAAG CCGGTGATCT GGTGAAGGTC TACATCATCG ATGTGGTTAA AACAACCAAG GGGCCGAAGA TACTCGTGAG CAGGAGAGTA CCGGAGTTCG TAATTGGCCT GATGAAACTC GAAATTCCGG AAGTGGAGAA TGGAATCGTG GAAATAAAGG CTATCGCCAG AGAACCCGGT GTTCGAACAA AGGTGGCAGT TGCATCGAAC GATCCGAACG TGGATCCCAT AGGTGCCTGC ATCGGTGAAG GAGGATCAAG GATAGCCGCC ATACTGAAGG AGCTCAAGGG TGAAAAACTC GACGTTCTGA AGTGGTCGGA CGATCCTAAA CAGCTCATAG CGAACGCCCT CGCGCCGGCT ACCGTCATAG AAGTGGAGAT ACTCGACAAA GAGAACAAGG CCGCACGCGT TCTAGTTCCT CCGACACAGC TTTCCCTCGC CATAGGAAAA GGAGGGCAGA ACGCGAGACT CGCTGCAAAG CTCACAGGAT GGAAAATAGA CATAAAACCG ATCATGAACC TGTGA
|
Protein sequence | MNIGLLEALD QLEEEKGISK EEVIPILEKA LVSAYRKNFG NSKNVEVVID RNTGNIRVYQ LLEVVEEVED PATQISLEEA KKIDPLAEVG SIVKKELNVK NFGRIAAQTA KQVLIQRIRE LEKEKQFEKY SELKGTVTTA EVIRVTSEWA DIRIGKLETR LPKKEWIPGE EIKAGDLVKV YIIDVVKTTK GPKILVSRRV PEFVIGLMKL EIPEVENGIV EIKAIAREPG VRTKVAVASN DPNVDPIGAC IGEGGSRIAA ILKELKGEKL DVLKWSDDPK QLIANALAPA TVIEVEILDK ENKAARVLVP PTQLSLAIGK GGQNARLAAK LTGWKIDIKP IMNL
|
| |