Gene VC0395_A2109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A2109 
SymbolrpoN 
ID5136127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp2265078 
End bp2266541 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content50% 
IMG OID640533565 
ProductRNA polymerase factor sigma-54 
Protein accessionYP_001218025 
Protein GI147673900 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCGT CATTACAGCT CAAGCTAGGT CAACAGTTAG CCATGACTCC GCAGCTCCAG 
CAGGCGATTC GTCTGTTGCA GCTTTCGACG CTGGATCTGC AGCAAGAAAT TCAGGAAGCA
TTGGAATCTA ACCCCTTGTT AGAGGTGGAA GAGAACCAAG ACGAAGCGCC ATCGCTTGAT
AGCGTACGTA TGACGGAAGA AAGCCCGCGC GAGCCAGAAG AACTCTACGA ACCTGAACCG
CAGGATAGCT CGGATCTGAT TGAAAAATCG GAAATCAGCG CTGAATTAGA GATGGATACC
ACTTGGGATG AAGTCTACAG CGCCAATACT GGCAGTACAG GACTGGCACT CGATGATGAT
GCCCCTATCT ACCAAGGCGA AACCACCCAA ACCCTGCAAG ATTATCTTCA CTGGCAACTC
GACTTAACTC CCTTTAGCGA TGTGGATCGC ACCATTGCCG TTGCACTTAT CGATGCTATC
GACGATTACG GTTATTTAAC CGTCTCACTA GAGGAGATCC AAGAGAGCCT GCGCAGTGAT
GACATTGAGT TGGATGAGAT TGAAGCGGTA CGTAAACGCA TTCAGCAGTT TGACCCCTTT
GGTGTGGCAT CGCTTAATCT GCAAGACTGC TTGTTGCTGC AACTCACCAC CTATCCCTGT
GATACTCCTT GGCTGGAAGA AGCGCGTTTA CTGCTCTCAC AGTACATCGA TGATTTAGGG
AATCGCGATT ACAAAACCAT TCTGAAAGAG ACTAAGCTCA AAGAAGAGGA CTTGCGCGAG
ATCCTGCAAC TCATCCAACA GCTCGATCCT CGCCCCGGCA GTCGAATTGC CCAAGATCAC
GCTGAATACG TTATACCTGA CGTGTCAGTC TATAAAGAAC AAGGTCGATG GCTGGTCACC
ATCAACCCAG ATAGTGTGCC TAAACTGAAG ATCAATCAGC AGTATGCCGA CCTGATGCGC
GGTAATAATG CGGAGAGCAA CTACATCCGT ACCAATTTGC AAGAGGCAAA ATGGCTGATC
AAAAGTTTAG AAAGTAGAAA CGAGACACTG CTCAAAGTTG CCAAGTGCAT AGTTGAACAC
CAACACGATT TCTTCGAGTA TGGAGAAGAG GCGATGAAGC CGATGGTGCT CAACGATGTG
GCGATGGCGG TGGAAATGCA TGAATCGACA ATTTCGCGTG TCACCACGCA GAAATACATG
CATACCCCGC GCGGCATTTT TGAGTTGAAG TACTTTTTCT CCAGCCACGT CAGTACTGAC
AATGGCGGAG AGTGCTCGTC CACTGCGATC AGGGCGCTGA TCAAAAAACT GGTGGCGGCA
GAGAATCCTG CCAAACCACT CAGTGACAGC AAGATCGCAA CCCTACTCGC TGACCAAGGC
ATCCAAGTCG CGCGGCGAAC CATCGCCAAA TATCGTGAAT CACTTGGCAT CGCCCCATCA
AGTCAGCGCA AACGCCTGCT ATAG
 
Protein sequence
MKPSLQLKLG QQLAMTPQLQ QAIRLLQLST LDLQQEIQEA LESNPLLEVE ENQDEAPSLD 
SVRMTEESPR EPEELYEPEP QDSSDLIEKS EISAELEMDT TWDEVYSANT GSTGLALDDD
APIYQGETTQ TLQDYLHWQL DLTPFSDVDR TIAVALIDAI DDYGYLTVSL EEIQESLRSD
DIELDEIEAV RKRIQQFDPF GVASLNLQDC LLLQLTTYPC DTPWLEEARL LLSQYIDDLG
NRDYKTILKE TKLKEEDLRE ILQLIQQLDP RPGSRIAQDH AEYVIPDVSV YKEQGRWLVT
INPDSVPKLK INQQYADLMR GNNAESNYIR TNLQEAKWLI KSLESRNETL LKVAKCIVEH
QHDFFEYGEE AMKPMVLNDV AMAVEMHEST ISRVTTQKYM HTPRGIFELK YFFSSHVSTD
NGGECSSTAI RALIKKLVAA ENPAKPLSDS KIATLLADQG IQVARRTIAK YRESLGIAPS
SQRKRLL