Gene SeHA_C3689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3689 
Symbol 
ID6489232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3575525 
End bp3576682 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content55% 
IMG OID642743807 
Productacriflavine resistance protein E 
Protein accessionYP_002047419 
Protein GI194450100 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value0.10776 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAAC ATGCCAGGTT TTCACTCCTG CCCTCATTCA TCATATTCTC TGCTGCGCTG 
CTGGCCGGTT GTAATGACCA GGGAGATACC CAGGCTCATG CCGGCGAGCC GCAAGTCACC
GTCCATGTGG TCGAAACAGC GCCGCTAGCC GTAACGACCG AACTTCCCGG ACGTACGTCC
GCATTTCGCA TTGCGGAGGT TCGCCCCCAG GTGAGCGGGA TCGTGCTTAA AAGAAACTTC
ACCGAAGGTA GCGATGTAGA GGCCGGACAG TCGCTTTATC AGATCGATCC TGCCACTTAT
CAGGCCGATT ATGACAGCGC TAAAGGCGAA CTTGCTAAAA GCGAAGCGGC TGCGGCTATC
GCGCACCTGA CGGTCAAACG CTATGTTCCA CTGGTCGGCA CAAAATATAT CAGCCAACAG
GAATATGATC AGGCGATTGC CGACGCCCGC CAGGCCGATG CCGCCGTTGT GGCGGCAAAA
GCCGCTGTTG AAAGCGCGCG TATTAACCTT GCGTATACCA AAGTCACCTC ACCCATCAGC
GGGCGTATAG GAAAATCTAA TGTGACTGAA GGCGCGCTGG TGACTAATGG TCAGTCAACT
GAACTGGCTA CCGTGCAACA ACTCGACCCG ATTTATGTCG ACGTGACGCA ATCAAGCAAC
GACTTTATGC GACTCAAGCA ATCCGTCGAA CAAGGTAACC TGCATAAAGA CAGCGCCAGT
AGCACGGTTC AACTGGTAAT GGAAAATGGT CAGGTCTACC CGATTAAAGG CACGCTGCAA
TTTTCCGACG TTACCGTAGA TGAAAGCACC GGCTCTATCA CGCTCAGGGC GGTGTTCCCT
AACCCGCAAC ACAGTCTGCT TCCCGGTATG TTTGTTCGCG CCCGCATTGA TGAAGGCGTC
CAGCCCAATG CCATCCTTGT CCCCCAGCAG GGTGTAACCC GCACGCCGCG CGGCGACGCA
ATGGTGATGG TGGTTAACGA TAAAAGTCAG GTCGAAGCCC GCAATGTCGT GGCAGCGCAG
GCTATTGGCG ATAAATGGCT CATCAGCGAA GGGTTAAAAC CGGGCGATAA GGTCATCGTC
AGCGGCTTAC AAAAAGCGCG ACCGGGCGTC CAGGTGAAAG CCACTACCGA TGCTCCTGCA
GCGAAAACGG CGCAATAA
 
Protein sequence
MTKHARFSLL PSFIIFSAAL LAGCNDQGDT QAHAGEPQVT VHVVETAPLA VTTELPGRTS 
AFRIAEVRPQ VSGIVLKRNF TEGSDVEAGQ SLYQIDPATY QADYDSAKGE LAKSEAAAAI
AHLTVKRYVP LVGTKYISQQ EYDQAIADAR QADAAVVAAK AAVESARINL AYTKVTSPIS
GRIGKSNVTE GALVTNGQST ELATVQQLDP IYVDVTQSSN DFMRLKQSVE QGNLHKDSAS
STVQLVMENG QVYPIKGTLQ FSDVTVDEST GSITLRAVFP NPQHSLLPGM FVRARIDEGV
QPNAILVPQQ GVTRTPRGDA MVMVVNDKSQ VEARNVVAAQ AIGDKWLISE GLKPGDKVIV
SGLQKARPGV QVKATTDAPA AKTAQ