Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1152 |
Symbol | nusA |
ID | 7977630 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1201809 |
End bp | 1202957 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798105 |
Product | transcription elongation factor NusA |
Protein accession | YP_002949278 |
Protein GI | 239826654 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000162475 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAC AATTACTCGA TGCCCTAGCG GATATTATCC GTGAAAAAGG CATTAGCAAA GAAGTTGTGA TGGAAGCGAT TGAAGCGGCG ATCATTTCTG CGTATAAACG CAATTTCGGC CAGGCGCAAA ACGTACGGGT TGATTTAAAT ACAGAAACAG GAACGATCCG TGTATTTGCG CGTAAAGAAG TGGTTGATGA AGTAAATGAT CCGCGCCTCG AAATTTCTTT AGAAGAAGCA CAACGAATCA ATCCGAATTA TCAAATCGGC GATGTTCTCG AGTTAGAAGT AACGCCGAAA GATTTTGGCC GCATTGCGGC ACAAACAGCG AAACAAGTTG TTACACAACG GGTGCGCGAA GCAGAGAGAA GCGTTATTTA TGCTGAGTTT GTTGACCGTG AAGAAGATAT CATGACAGGG ATCGTACAAC GTGTAGATCC TCGTTTTGTT TATGTCAGTC TCGGTAAAAC AGAAGCGTTG CTGCCGGCTA GCGAACAAAT GCCGAACGAA ACATACAAAC CGCACGACCG CATTAAAGTG TATATTACAA AAGTGGAAAA GACAACAAAA GGACCACAAA TTTTTGTTTC ACGCACCCAT CCGGGATTGC TTAAGCGTCT TTTTGAACTG GAAGTCCCAG AAATTTATGA TGGAACAGTA GAAATTAAAT CCATTGCCCG TGAGGCGGGA GACCGCTCGA AAATTTCGGT GCATTCCGAC AATCCGGAAG TGGATCCGGT CGGCGCGTGC GTCGGTCCAA AAGGGCAGCG GGTTCAAGCG GTTGTGGAGG AATTAAACGG GGAAAAAATT GATATCGTCC GCTGGTCTGC AGATCCTGTT GAGTTTGTCG CAAACGCATT AAGTCCGGCA AAAGTGTTGC GTGTAATCGT CAATGAAGAA CAAAAAGCAA CGACCGTAAT CGTGCCGGAT TATCAGCTGT CATTAGCAAT CGGCAAACGC GGGCAAAACG CTCGGCTAGC AGCAAAGCTG ACAAACTGGA AAATTGATAT TAAAAGCGAA TCGGAAGCGA GAGAATTAGG AATCGATCCA TATGCGCAAT CCACTTTTCT TGATTCAGAA GAGACATCGG TCAATAATGA AAATGACAGC AACCAATCAT TCGATTTACA AGAAGAGAAA ATCGAGTGA
|
Protein sequence | MNTQLLDALA DIIREKGISK EVVMEAIEAA IISAYKRNFG QAQNVRVDLN TETGTIRVFA RKEVVDEVND PRLEISLEEA QRINPNYQIG DVLELEVTPK DFGRIAAQTA KQVVTQRVRE AERSVIYAEF VDREEDIMTG IVQRVDPRFV YVSLGKTEAL LPASEQMPNE TYKPHDRIKV YITKVEKTTK GPQIFVSRTH PGLLKRLFEL EVPEIYDGTV EIKSIAREAG DRSKISVHSD NPEVDPVGAC VGPKGQRVQA VVEELNGEKI DIVRWSADPV EFVANALSPA KVLRVIVNEE QKATTVIVPD YQLSLAIGKR GQNARLAAKL TNWKIDIKSE SEARELGIDP YAQSTFLDSE ETSVNNENDS NQSFDLQEEK IE
|
| |