Gene VC0395_A0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVC0395_A0044 
SymbolrpoD 
ID5136802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVibrio cholerae O395 
KingdomBacteria 
Replicon accessionNC_009457 
Strand
Start bp41202 
End bp43067 
Gene Length1866 bp 
Protein Length621 aa 
Translation table11 
GC content47% 
IMG OID640531504 
ProductRNA polymerase sigma factor RpoD 
Protein accessionYP_001216017 
Protein GI147674012 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00416651 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAA ATCCGCAGTC ACAGCTTAAA CAACTTGTCC TTCGCGGCAA GGAACAGGGC 
TATCTGACCT ACGCCGAAGT AAATGACCAC TTGCCTGCTG AAATCGTTGA TTCAGAGCAG
GTGGAAGATA TTATTCAGAT GATTAATGAC ATGGGCATTA AGGTGGTGGA AACCGCACCT
GATGCCGATG ATCTTGCCCT CAGCGATGAT ACTACCATCA CTGACGAAGA TGCTGCTGAA
GCAGCCGCCG CGGCACTTTC TAGCGTAGAG AGTGAGATTG GCCGCACCAC CGATCCAGTA
CGTATGTATA TGCGTGAAAT GGGTACGGTT GAGCTTTTGA CACGTGAAGG TGAAATCGAT
ATTGCCAAGC GCATTGAAGA TGGTATTAAC CAAGTTCAAA GTGCGATTGC TGAGTATCCT
GGAACCATCC CTTATATTCT TGAGCAGTTT GATCGTGTTC AGGCCGAAGA GCTACGTCTC
ACTGACCTGA TTTCAGGTTT CGTTGACCCT AACGACATGG AAACCGAAGC GCCAACCGCG
ACTCACATCG GTTCTGAGCT TTCTGAAGCG GATCTCGCGG ATGAAGATGA TGCTGTCGTC
GAAGATGAAG ACGAAGATGG CGACGGTGAA AGCAGCGACA GCGAAGAAGA AGTCGGTATC
GACCCTGAAC TGGCTCGTGA GAAATTCAAT GAACTGCGCG GTAAGTTCCA AAACCTGCAA
TTAGCGGTTA ATGAATTTGG TCGTGACAGT CATCAAGCTT CTGAAGCGTC AGACTTAGTG
CTGGATATCT TCCGTGAATT CCGCCTAACA CCAAAGCAAT TCGACCACTT GGTTGAAACT
CTGCGCACTT CAATGGATCG TGTTCGCACC CAAGAACGTT TGGTAATGAA AGCGGTAGTT
GAAGTCGCGA AGATGCCGAA GAAATCGTTC ATCGCCCTAT TTACAGGCAA TGAATCGAAT
GAAGAGTGGC TGGATAAAGT CCTTGCTTCT GACAAGCCTT ACGTAGCGAA AGTCCGTGAG
CAAGAAGAAG AGATCCGCCG TTCAATTCAG AAACTACAAA TGATCGAGCA AGAGACATCA
CTGTCTGTTG AACGCATCAA AGACATCAGC CATCGTATGT CAATCGGTGA GGCGAAAGCT
CGCCGTGCGA AGAAAGAGAT GGTTGAAGCA AACTTACGTC TGGTAATTTC GATTGCTAAG
AAATACACCA ACCGTGGTCT GCAATTCTTG GATCTGATCC AAGAAGGTAA CATCGGTTTG
ATGAAAGCCG TCGATAAGTT TGAATACCGT CGTGGTTATA AATTCTCAAC TTATGCGACT
TGGTGGATCC GTCAGGCAAT CACTCGTTCT ATTGCTGACC AAGCACGTAC GATTCGTATT
CCAGTACACA TGATCGAGAC GATCAATAAA CTGAATCGAA TTTCGCGCCA AATGCTGCAA
GAGATGGGTC GTGAGCCACT GCCTGAAGAA TTGGCAGAAC GCATGCAAAT GCCAGAAGAC
AAAATCCGCA AAGTGCTGAA AATTGCCAAA GAACCCATCT CCATGGAAAC ACCGATTGGT
GATGATGAAG ATTCGCATCT GGGTGATTTT ATCGAGGATA CCACCCTCGA GCTGCCACTG
GATTCTGCGA CCGCAACCAG CCTTAAAGCT GCCACTCGCG ATGTATTAGC AGGCCTAACA
CCTCGTGAAG CGAAAGTGCT ACGTATGCGT TTCGGTATCG ATATGAATAC CGACCATACT
CTGGAAGAAG TGGGCAAGCA GTTTGATGTA ACTCGTGAAC GTATTCGTCA GATTGAAGCA
AAAGCACTGC GTAAACTGCG TCATCCAAGC CGCTCAGAAG TTCTGCGCAG CTTCTTGGAT
GAATAA
 
Protein sequence
MDQNPQSQLK QLVLRGKEQG YLTYAEVNDH LPAEIVDSEQ VEDIIQMIND MGIKVVETAP 
DADDLALSDD TTITDEDAAE AAAAALSSVE SEIGRTTDPV RMYMREMGTV ELLTREGEID
IAKRIEDGIN QVQSAIAEYP GTIPYILEQF DRVQAEELRL TDLISGFVDP NDMETEAPTA
THIGSELSEA DLADEDDAVV EDEDEDGDGE SSDSEEEVGI DPELAREKFN ELRGKFQNLQ
LAVNEFGRDS HQASEASDLV LDIFREFRLT PKQFDHLVET LRTSMDRVRT QERLVMKAVV
EVAKMPKKSF IALFTGNESN EEWLDKVLAS DKPYVAKVRE QEEEIRRSIQ KLQMIEQETS
LSVERIKDIS HRMSIGEAKA RRAKKEMVEA NLRLVISIAK KYTNRGLQFL DLIQEGNIGL
MKAVDKFEYR RGYKFSTYAT WWIRQAITRS IADQARTIRI PVHMIETINK LNRISRQMLQ
EMGREPLPEE LAERMQMPED KIRKVLKIAK EPISMETPIG DDEDSHLGDF IEDTTLELPL
DSATATSLKA ATRDVLAGLT PREAKVLRMR FGIDMNTDHT LEEVGKQFDV TRERIRQIEA
KALRKLRHPS RSEVLRSFLD E