Gene SeD_A3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3021 
Symbol 
ID6874580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp2921316 
End bp2922506 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content54% 
IMG OID642786054 
Producttype I secretion membrane fusion protein, HlyD family 
Protein accessionYP_002216700 
Protein GI198245008 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01843] type I secretion membrane fusion protein, HlyD family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.0405011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATCA ATCAGCATGA TGCCGCGATG GACGATCCCG ATATTCAGCG TGAACGGGCG 
TTTTCCGGCG CGGGTCGTAT TGTTCTGATC TGCTCACTGT TATTTCTCAT TCTCGGCATC
TGGGCGTGGT TTGGCCGACT GGATGAGGTT TCCACCGGCA ACGGGAAAGT GATCCCCAGT
TCACGCGAAC AGGTTCTGCA GTCGCTGGAT GGCGGCATTC TGGCGCAGTT GACGGTGCGG
GAAGGCGACA GAGTTCAGGC TAACCAGATT GTCGCCCGGC TTGATCCGAC GCGTCTGGCG
TCCAATGTGG GTGAAAGTGC GGCAAAATAT CGCGCTTCAC TCGCCTCCAG CGCACGGTTA
ACCGCGGAAG TCAACGACTT ACCTCTCGCC TTCCCCGCTG AGCTGAACGG CTGGCCGGAT
CTGATTGCCG CAGAGACGCG TCTCTATAAA AGCCGCCGCG CGCAGCTGGC CGATACCGAA
GCCGAGCTAC GGGATGCGCT GGCGTCGGTT AATAAAGAGC TGGCCATTAC CCAGCGTCTG
GAGAAAAGCG GCGCGGCCAG TCATGTTGAA GTGCTGCGCC TGCAACGACA AAAAAGCGAT
TTAGGCTTAA AAATTACCGA TCTGCGCTCA CAATATTATG TGCAGGCACG CGAAGCGTTA
TCAAAAGCGA ACGCTGAGGT CGATATGCTC TCCGCCATTT TAAAAGGACG CGAGGATTCC
GTCACCCGCC TTACCATACG TTCGCCGGTA CGCGGCATTG TTAAAAATAT CCAGGTCACG
ACGATTGGCG GCGTGATCCC GCCTAACGGT GAGATGATGG AGATAGTGCC GGTAGACGAT
CGTCTGTTGA TTGAAACCCG CCTTTCGCCG CGTGATATCG CCTTTATTCA TCCCGGCCAA
CGCGCATTGG TTAAAATTAC TGCTTACGAT TACGCCATTT ACGGCGGGCT TGACGGCGTG
GTGGAGACCA TTTCACCGGA TACCATTCAG GATAAAGTGA AACCGGAAAT TTTCTACTAT
CGCGTGTTTA TCCGCACCCA CCAGGACTAT CTACAAAATA AATCAGGACG CCGTTTTTCG
ATTGTTCCAG GCATGATCGC CACGGTGGAT ATCAAAACCG GTGAAAAAAC CATTGTCGAC
TATTTAATCA AACCGTTTAA TCGCGCGAAA GAAGCGCTGC GCGAGCGGTA A
 
Protein sequence
MKINQHDAAM DDPDIQRERA FSGAGRIVLI CSLLFLILGI WAWFGRLDEV STGNGKVIPS 
SREQVLQSLD GGILAQLTVR EGDRVQANQI VARLDPTRLA SNVGESAAKY RASLASSARL
TAEVNDLPLA FPAELNGWPD LIAAETRLYK SRRAQLADTE AELRDALASV NKELAITQRL
EKSGAASHVE VLRLQRQKSD LGLKITDLRS QYYVQAREAL SKANAEVDML SAILKGREDS
VTRLTIRSPV RGIVKNIQVT TIGGVIPPNG EMMEIVPVDD RLLIETRLSP RDIAFIHPGQ
RALVKITAYD YAIYGGLDGV VETISPDTIQ DKVKPEIFYY RVFIRTHQDY LQNKSGRRFS
IVPGMIATVD IKTGEKTIVD YLIKPFNRAK EALRER