Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | VC0395_A2149 |
Symbol | rpoA |
ID | 5136172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Vibrio cholerae O395 |
Kingdom | Bacteria |
Replicon accession | NC_009457 |
Strand | - |
Start bp | 2305405 |
End bp | 2306397 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640533605 |
Product | DNA-directed RNA polymerase subunit alpha |
Protein accession | YP_001218065 |
Protein GI | 147674278 |
COG category | [K] Transcription |
COG ID | [COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit |
TIGRFAM ID | [TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000019407 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGGTT CTGTAACAGA ATTTCTTAAG CCACGTCTTG TTGATATCGA ACAAATCAGC ACGACACACG CAAAAGTAAC TCTTGAGCCG TTAGAGCGTG GTTTCGGCCA TACTCTGGGT AATGCACTTC GCCGTATTCT TCTATCTTCA ATGCCAGGTT GTGCTGTGAC TGAAGTAGAG ATTGAAGGCG TTCTTCACGA GTACAGCACC AAAGAAGGTG TTCAGGAAGA TATCCTTGAG ATTCTCTTGA ACCTGAAAGG TCTGGCTGTT CGCGTTGCCG AAGGCAAAGA TGAAGTGTTC ATTACACTGA ACAAATCAGG CTCGGGCCCT GTGGTTGCAG GTGACATCAC CCATGACGGT GATGTAGAGA TCGTAAACCC TGAACACGTT ATTTGTCATT TAACTTCTGA CAATGCTGCG ATCGCTATGC GTATCAAAGT AGAACGTGGT CGTGGTTATG TTCCAGCTTC TGCCCGTATC CATACTGAAG AAGATGAGCG TCCAATTGGT CGTTTGCTTG TTGACGCGAC TTTCAGCCCA GTAGACAAAA TTGCCTACTC TGTTGAAGCA GCTCGTGTTG AACAGCGTAC TGACTTGGAC AAGCTTGTTA TCGATATGGA AACTAACGGT ACTCTTGAGC CTGAGGAAGC AATCCGTCGC GCAGCAACAA TTCTTGCTGA GCAATTGGAT GCGTTCGTAG ATCTTCGTGA TGTACGTGTA CCTGAGGAGA AGGAAGAGAA GCCAGAATTC GATCCGATCC TACTGCGTCC TGTAGACGAT CTTGAACTAA CAGTTCGCTC TGCTAACTGT CTGAAAGCAG AAGCGATTCA CTACATCGGT GATCTGGTAC AGCGCACTGA GGTTGAGCTT CTTAAAACGC CAAACCTCGG TAAGAAGTCT CTTACAGAGA TTAAAGACGT GCTTGCATCA CGTGGTCTGT CTCTGGGCAT GCGTCTAGAA AACTGGCCAC CAGCGTCAAT CGCTGAAGAT TAA
|
Protein sequence | MQGSVTEFLK PRLVDIEQIS TTHAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE IEGVLHEYST KEGVQEDILE ILLNLKGLAV RVAEGKDEVF ITLNKSGSGP VVAGDITHDG DVEIVNPEHV ICHLTSDNAA IAMRIKVERG RGYVPASARI HTEEDERPIG RLLVDATFSP VDKIAYSVEA ARVEQRTDLD KLVIDMETNG TLEPEEAIRR AATILAEQLD AFVDLRDVRV PEEKEEKPEF DPILLRPVDD LELTVRSANC LKAEAIHYIG DLVQRTEVEL LKTPNLGKKS LTEIKDVLAS RGLSLGMRLE NWPPASIAED
|
| |