Gene SeD_A1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A1139 
Symbol 
ID6871376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp1133664 
End bp1135304 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content51% 
IMG OID642784323 
Productparaquat-inducible protein B 
Protein accessionYP_002214997 
Protein GI198243917 
COG category[R] General function prediction only 
COG ID[COG3008] Paraquat-inducible protein B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0102675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCTA AAAAAGGGGA AGCCAAAGTA CAAAAGGTGA AAAATTGGTC GCCTGTCTGG 
ATATTCCCGA TTGTTACTGC GCTTATCGGA GCCTGGATTC TGTTTTATCA CTACAGCCAC
CAGGGACCGG AAGTTACCTT AATCACCACC AATGCGGAAG GCATTGAAGG CGGAAAAACG
ACGATCAAAA GTCGCAGCGT GGATGTTGGC GTGGTTGAAA GCGCGACGCT GACTGACGAT
CTGACCCACG TACAAATCAA AGCGCGGCTC CATTCCGGTA TGGAAAAGTT GCTGCATAAA
GACTCGGTAT TTTGGGTGGT AAAACCGCAG GTGGGGCGTG AAGGCATCAG CGGGCTTGGG
ACGCTGCTAT CAGGTGCGTA TATTGAACTA CAACCAGGAA GTAAGGGCAG TCAGCCGGAA
AGCTATCAGC TTCTTGACTC ACCGCCGTTG GCGCCGCCCG ATGCCAAAGG TATTCGTGTG
ATTCTGGACA GCAAAAAGGC CGGTCAGCTC AGTCCTGGCG ATCCCGTTCT GTTCCGGGGC
TATCGGGTAG GGTCCGTTGA AACCAGCTCT TTCGATCCGC AAAAACGGAC AATGAGTTAT
CAGTTGTTCA TTAAGGCGCC AAACGATCGG TTGGTCACCA GTAATGTCCG ATTCTGGAAA
GATAGCGGTA TCGCTGTGGA TCTGACATCG GCGGGAATGC GCGTGGAAAT GGGATCGTTG
ACGACGTTGT TTGGCGGCGG CGTGAGTTTT GATGTGCCGG AAGGACTTGA GCAGGGGCAA
CCCGTCGCCG AAAAAACGGC GTTTAATCTC TACGACGATC AAAAAAGTAT TCAGGATTCG
CTGTATACCG ATCATATCGA TTACCTGATG TTCTTTAAAG ATTCGGTGCG CGGATTACAA
CCCGGCGCGC CGCTGGAGTT CCGCGGTATT CGTCTGGGGA CGGTAAGCAA AGTGCCTTTC
TTTGCGTCTA AAATGCGCCA GGTATTTAAC GACGATTACC GTATTCCTGT GCTGGTGCGC
ATTGAACCGG AGCGTCTGAA AGCGCAATTG GGAGAAAATG CGGATGTTGG CGCGCATTTG
ACGGAACTGC TTAAGCGCGG TTTACGCGCT TCGCTTAAAA CCGGTAACCT GGTGACCGGG
GCGCTGTATG TCGATCTGGA CTTTTATCCT AAGGAGCCGC CGATTACCGG GCTACGCGAA
TTTGATGGTT ATGAAATTAT TCCCACCGTC AGCAGCGGCC TGGCGCAAAT TCAACAGCGA
CTGGTGGAAA CGTTGGATAA GATCAACAAC CTGCCGCTGA ATCCGATGAT TGAACAAGCG
ACCAATACGC TGTCTGAAAG CCAGCGTACT ATGCGTCGGC TGCAAACCAC GCTGGATAAT
ATGAACAAGA TTACCTCCAG TCAGTCGATG CAGCAGCTTC CGGCGGATAT GCAAACGACG
TTACGCGAAC TTAACCGCAG TATGCAGGGC TTCCAGCCTG GATCGGCGGC GTATAACAAA
ATGGTGGCGG ATATGCAGCG TCTCGATCAG GTGCTTCGTG AGTTACAACC GGTGTTGAAA
ACTCTGAACG AGAAGAGCAA CGCGCTGGTA TTTGAAGCGA AGGATAAAAA AGATCCTGAG
CCTAAGAGGG CGAAACAATG A
 
Protein sequence
MEPKKGEAKV QKVKNWSPVW IFPIVTALIG AWILFYHYSH QGPEVTLITT NAEGIEGGKT 
TIKSRSVDVG VVESATLTDD LTHVQIKARL HSGMEKLLHK DSVFWVVKPQ VGREGISGLG
TLLSGAYIEL QPGSKGSQPE SYQLLDSPPL APPDAKGIRV ILDSKKAGQL SPGDPVLFRG
YRVGSVETSS FDPQKRTMSY QLFIKAPNDR LVTSNVRFWK DSGIAVDLTS AGMRVEMGSL
TTLFGGGVSF DVPEGLEQGQ PVAEKTAFNL YDDQKSIQDS LYTDHIDYLM FFKDSVRGLQ
PGAPLEFRGI RLGTVSKVPF FASKMRQVFN DDYRIPVLVR IEPERLKAQL GENADVGAHL
TELLKRGLRA SLKTGNLVTG ALYVDLDFYP KEPPITGLRE FDGYEIIPTV SSGLAQIQQR
LVETLDKINN LPLNPMIEQA TNTLSESQRT MRRLQTTLDN MNKITSSQSM QQLPADMQTT
LRELNRSMQG FQPGSAAYNK MVADMQRLDQ VLRELQPVLK TLNEKSNALV FEAKDKKDPE
PKRAKQ