Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A0173 |
Symbol | nusA |
ID | 5136621 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | + |
Start bp | 174612 |
End bp | 176099 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 640531633 |
Product | transcription elongation factor NusA |
Protein accession | YP_001216136 |
Protein GI | 147674353 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA [TIGR01954] transcription termination factor NusA, C-terminal duplication |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000000303519 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG AAATTTTAGC GGTAGTTGAG GCGGTTTCTA ACGAGAAAGC GGTACCTCGT GAGCGTATTT TTGAAGCGCT GGAAACTGCG TTAGCGACTT CAACCAAAAA GAAGTATGAA ATCGAGATCG ATGTGCGCGT AGCCATCGAT CGCAAAACAG GTGCGTTTGA AACGTTCCGC CGTTGGTTAG TGGTTGAGAA TGTTGAGCAC CCAACCAAAG AGATCTCATT TGAAGCGGCG AGCTTTGATG ATGACTCGGT TCAACTGGGT GACTATATCG AAGATCAGAT CGAATCAGTA ACCTTTGACC GTATTACCAC GCAAACTGCG AAACAAGTGA TCGTACAAAA AGTGCGTGAA GCAGAACGTG CACAGATTGT TGAACAATTC ATCGATAACG AAGGTGAATT GGTGACTGGT GTAGTGAAGA AAGTGAACCG TGAAACCGTG ATCCTCGATC TGGGTAACAA CGCTGAAGCG GTTATCCTAC GTGAAGATCA ACTGCCACGC GAAAACTTCC GTCCTGGCGA TCGTGTTCGT GGTTTGCTGT ACAAAGTGGC TCCAGAAGCA CGTGGTTTCC AACTCTTCAT CACTCGTTCT AAGCCAGAAA TGTTGGCAGA ACTGTTCCGT GTAGAAGTTC CAGAAATTGG TGAAGAGCTG ATTGAGCTGA AAGCGGCTGC GCGTGATCCT GGTTCACGTG CCAAGATCGC GGTAAAAACC AATGATAAGC GTATCGACCC TGTTGGTGCG TGTGTGGGTA TGCGTGGTGC ACGTGTACAA GCGGTTTCTG GCGAACTGGG TGGCGAGCGT ATTGATATCG TTTTGTGGGA TGATAACCCA GCGCAATTCG TCATTAATGC CATGGCACCA GCCGATGTGG CATCGATCAT TATGGATGAA GATGCACACT CAATGGATAT CGCTGTTGAG GCTGATAACC TTGCGCAAGC GATCGGTCGT AACGGTCAAA ACGTACGTTT GGCTTCTCAA CTGACCGGTT GGGAACTGAA TGTGATGACG GTTGCGGATC TACAGAAGAA ACACCAAGAA GAGTCAATGG CATCCATCGA GAACTTCATG AAGTATCTCG ATATTGAGCA AGATTTTGCT GAGCTGCTGG TTGAAGAAGG CTTCTCTACG CTAGAAGAGA TCGCTTACGT ACCAATGAAT GAGCTGCTTG ATGTTGATGG CATGGATGAA GATCTGGCAG ATGAGCTACG TAGTCGTGCG AAAGAAGCGC TGACGACGAT TGCTTTGGCA AAAGAAGAGT CGTTCGAAGG CCTTGAGCCT GCAGAAGATC TGCTTGCATT AGCAGGACTG GAACGCGACA TGGCATTCAA ACTGGCAGCG AAAGGTGTTG CGACACTGGA AGATCTGGCT GACCAAGGTG TGGATGATTT AGAAGGTATT GAAGGTCTGA CCGAACAGCG TGCGGGTGAG CTTATTATGG CGGCTCGTAA CATTTGTTGG TTTGGTGAAG ACGCATAA
|
Protein sequence | MSKEILAVVE AVSNEKAVPR ERIFEALETA LATSTKKKYE IEIDVRVAID RKTGAFETFR RWLVVENVEH PTKEISFEAA SFDDDSVQLG DYIEDQIESV TFDRITTQTA KQVIVQKVRE AERAQIVEQF IDNEGELVTG VVKKVNRETV ILDLGNNAEA VILREDQLPR ENFRPGDRVR GLLYKVAPEA RGFQLFITRS KPEMLAELFR VEVPEIGEEL IELKAAARDP GSRAKIAVKT NDKRIDPVGA CVGMRGARVQ AVSGELGGER IDIVLWDDNP AQFVINAMAP ADVASIIMDE DAHSMDIAVE ADNLAQAIGR NGQNVRLASQ LTGWELNVMT VADLQKKHQE ESMASIENFM KYLDIEQDFA ELLVEEGFST LEEIAYVPMN ELLDVDGMDE DLADELRSRA KEALTTIALA KEESFEGLEP AEDLLALAGL ERDMAFKLAA KGVATLEDLA DQGVDDLEGI EGLTEQRAGE LIMAARNICW FGEDA
|
| |