Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1002 |
Symbol | nusA |
ID | 7316579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1082369 |
End bp | 1083874 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643615889 |
Product | transcription elongation factor NusA |
Protein accession | YP_002513077 |
Protein GI | 220934178 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAG AGATCCTGAT GGTGGTGGAC GCCGTTTCCA ACGAGAAGGG CGTCGAGAAG GAAGTGATCT TTGAGGCCAT CGAGGCCGCG CTCGGCTCCG CCACGCGCAA GAAGCACGGC GGCGAGATCG ACGTGCGCGT CAGCATCCAC CGGGAGACCG GCGACTACGA GACCTTCCGC CGCTGGCTGG TGGTGGACGA CGAGGACCCG GAATTCGAGA GCCCGGAGCG CCAGAAGCTG CTGTCCTACG CGCACCAGAC CCATCCCGAC ATCCAGGTGG GCGAGTACAT CGAGGAGCCC ATGGAATCCG TGGACTTCGG ACGCATCGCC GCCCAGACCG CCAAGCAGGT GATCGTGCAG AAGGTGCGCG AGGCCGAGCG TGCCAAGGTG GTCGAAGAGT TCAGACACCG GGTCGGCGAG CTGGTGATGG GCGTGGTCAA GCGCACCGAG CGCAGCGGCG TGTTCCTGGA CCTGGGCGGC AATGCCGAGG CCTTCATCCC CCGCGAGGAG ATGATCCCCC GGGAGACGGT GCGCCCCAAC GATCGCCTGC GCGGTTACCT CAAGGAGGTG CGTTCCGAGC CCCGCGGCCC GCAGCTGTTC GTCAGCCGCA CGGCGCCGGA ATTCCTCATC GAGATGTTCA AGCTGGAAGT GCCCGAGGTG GGCCAGGGCC TGATCGACAT CATGGGCGCC GCCCGTGACC CGGGTCTTCG CGCCAAGATC GCGGTGCGTT CCAATGATTC GCGCCTGGAT CCCGTGGGCG CCTGCGTGGG CATGCGCGGT TCCCGGGTGC AGTCGGTGTC CAACGAGCTG GCCGGCGAGC GCGTCGACAT CATTCTCTGG GACGATAACC CGGCCCAGTT CGTGATCAAT GCCATGTCGC CCGCCGAGGT GCTGTCCATC GTCATGGACG AGGACCGCCA CAGCATGGAC ATCGCCGTGG CCGAGGAGAA GCTCTCCCAG GCCATCGGGC GGGGCGGGCA GAACATCCGC CTGGCCTCCC AGCTCACCGG TTGGGAACTG AACGTGATGA ACGAGGCCCA GGCCGCCGAG AAGAGCGAGG CCGAGGCCGC CGAGCTGCAG CGCATGTTCA TGGAGAACCT GGACGTGGAC GAGGAGGTCG CCGCCATCCT GGTCCAGGAA GGTTTCTCCA GCATCGAGGA AGTGGCCTAC GTGCCCACCT CGGAGATCAT GCAGATCGAG GAGTTCGACG AGGAGATCGT CGAGGAGCTG CGTGCCCGTG CCCGGGACGC CCTGCTCACC AAGGCCATCG CTACGGAGGA ACAGGGTGGC GGTGAGCCCG CCGAGGACCT GCTCAACATG GAGGGCATGG ACCGGGATCT GGCCTTCAAG CTGGCTGCCA AGGGCATCTG CACCATGGAA GATCTGGCCG AGCAGGCCGT GGACGACCTG GTGGAGATCT CCGGTCTGGA CGCCGAGAAG GCCGGCGAGC TGATCATGAC CGCCCGGGCG CCCTGGTTCG CGGAGGATGC GGGCGAGGCC AAGTAA
|
Protein sequence | MNKEILMVVD AVSNEKGVEK EVIFEAIEAA LGSATRKKHG GEIDVRVSIH RETGDYETFR RWLVVDDEDP EFESPERQKL LSYAHQTHPD IQVGEYIEEP MESVDFGRIA AQTAKQVIVQ KVREAERAKV VEEFRHRVGE LVMGVVKRTE RSGVFLDLGG NAEAFIPREE MIPRETVRPN DRLRGYLKEV RSEPRGPQLF VSRTAPEFLI EMFKLEVPEV GQGLIDIMGA ARDPGLRAKI AVRSNDSRLD PVGACVGMRG SRVQSVSNEL AGERVDIILW DDNPAQFVIN AMSPAEVLSI VMDEDRHSMD IAVAEEKLSQ AIGRGGQNIR LASQLTGWEL NVMNEAQAAE KSEAEAAELQ RMFMENLDVD EEVAAILVQE GFSSIEEVAY VPTSEIMQIE EFDEEIVEEL RARARDALLT KAIATEEQGG GEPAEDLLNM EGMDRDLAFK LAAKGICTME DLAEQAVDDL VEISGLDAEK AGELIMTARA PWFAEDAGEA K
|
| |