Gene VC0395_A2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2149 
SymbolrpoA 
ID5136172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2305405 
End bp2306397 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content47% 
IMG OID640533605 
ProductDNA-directed RNA polymerase subunit alpha 
Protein accessionYP_001218065 
Protein GI147674278 
COG category[K] Transcription 
COG ID[COG0202] DNA-directed RNA polymerase, alpha subunit/40 kD subunit 
TIGRFAM ID[TIGR02027] DNA-directed RNA polymerase, alpha subunit, bacterial and chloroplast-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000019407 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGGTT CTGTAACAGA ATTTCTTAAG CCACGTCTTG TTGATATCGA ACAAATCAGC 
ACGACACACG CAAAAGTAAC TCTTGAGCCG TTAGAGCGTG GTTTCGGCCA TACTCTGGGT
AATGCACTTC GCCGTATTCT TCTATCTTCA ATGCCAGGTT GTGCTGTGAC TGAAGTAGAG
ATTGAAGGCG TTCTTCACGA GTACAGCACC AAAGAAGGTG TTCAGGAAGA TATCCTTGAG
ATTCTCTTGA ACCTGAAAGG TCTGGCTGTT CGCGTTGCCG AAGGCAAAGA TGAAGTGTTC
ATTACACTGA ACAAATCAGG CTCGGGCCCT GTGGTTGCAG GTGACATCAC CCATGACGGT
GATGTAGAGA TCGTAAACCC TGAACACGTT ATTTGTCATT TAACTTCTGA CAATGCTGCG
ATCGCTATGC GTATCAAAGT AGAACGTGGT CGTGGTTATG TTCCAGCTTC TGCCCGTATC
CATACTGAAG AAGATGAGCG TCCAATTGGT CGTTTGCTTG TTGACGCGAC TTTCAGCCCA
GTAGACAAAA TTGCCTACTC TGTTGAAGCA GCTCGTGTTG AACAGCGTAC TGACTTGGAC
AAGCTTGTTA TCGATATGGA AACTAACGGT ACTCTTGAGC CTGAGGAAGC AATCCGTCGC
GCAGCAACAA TTCTTGCTGA GCAATTGGAT GCGTTCGTAG ATCTTCGTGA TGTACGTGTA
CCTGAGGAGA AGGAAGAGAA GCCAGAATTC GATCCGATCC TACTGCGTCC TGTAGACGAT
CTTGAACTAA CAGTTCGCTC TGCTAACTGT CTGAAAGCAG AAGCGATTCA CTACATCGGT
GATCTGGTAC AGCGCACTGA GGTTGAGCTT CTTAAAACGC CAAACCTCGG TAAGAAGTCT
CTTACAGAGA TTAAAGACGT GCTTGCATCA CGTGGTCTGT CTCTGGGCAT GCGTCTAGAA
AACTGGCCAC CAGCGTCAAT CGCTGAAGAT TAA
 
Protein sequence
MQGSVTEFLK PRLVDIEQIS TTHAKVTLEP LERGFGHTLG NALRRILLSS MPGCAVTEVE 
IEGVLHEYST KEGVQEDILE ILLNLKGLAV RVAEGKDEVF ITLNKSGSGP VVAGDITHDG
DVEIVNPEHV ICHLTSDNAA IAMRIKVERG RGYVPASARI HTEEDERPIG RLLVDATFSP
VDKIAYSVEA ARVEQRTDLD KLVIDMETNG TLEPEEAIRR AATILAEQLD AFVDLRDVRV
PEEKEEKPEF DPILLRPVDD LELTVRSANC LKAEAIHYIG DLVQRTEVEL LKTPNLGKKS
LTEIKDVLAS RGLSLGMRLE NWPPASIAED