Gene SeHA_C4220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4220 
Symbol 
ID6490131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4107399 
End bp4108826 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content52% 
IMG OID642744314 
Productinner membrane transport protein YieO 
Protein accessionYP_002047918 
Protein GI194449681 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0177316 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGCA AAAAAGCGCG CAGTATGGCT GGCCTGCCGT GGATCGCGGC GATGGCCTTC 
TTTATGCAGG CGCTAGACGC CACCATTTTG AATACCGCTT TACCGGCTAT TGCGCATAGT
CTTAACCGTT CACCACTCGC CATGCAGTCC GCTATTATCA GTTATACCCT GACAGTAGCT
ATGTTAATTC CGGTAAGCGG CTGGCTGGCT GACCGCTTTG GTACGCGTCG CGTTTTTATG
GTTGCCGTAA GTCTGTTTAC GTTAGGCTCG CTGGCCTGCG CGCTCTCCAG TTCACTATCT
GAATTGGTTA TTTTTCGCGT CGTACAAGGC GTCGGCGGAG CGATGATGAT GCCAGTGGCG
CGGCTGGCGT TATTGCGAGC CTATCCGCGT AGTGAGCTGC TGCCCGTTCT TAACTTTGTC
ACTATGCCGG GGCTGGTAGG TCCGATTCTG GGGCCGGTAT TAGGCGGCGT ACTGGTGACC
TGGGCAAGCT GGCACTGGAT CTTCCTGATT AATATTCCCA TTGGTGTTGC AGGCATTCTG
TATGCCCGCA AATATATGCC CAACTTCACC ACGCCGCGTC GCAAGTTTGA CATGACCGGC
TTCTTTCTTT TTGGGTTAAG TCTGGTTTTA TTTTCCAGCG GAATGGAACT GTTTGGCGAA
AAGATTGTGG CGACATGGAT CGCATCCGCC ATTATTTTTT GCAGTATCGT TCTACTATTG
GCCTATATCC GCCACGCCCG CCGTCATCCG ACACCGTTAA TATCACTATC GCTGTTTAAG
ACACGCACAT TTTCCGTCGG TATTGCCGGC AACCTCGCCA CGCGTCTGGG GACAGGCTGC
GTACCTTTTT TGATGCCGTT AATGCTACAG GTTGGTTTTG GCTACCCGGC TCTGATCGCC
GGCTGCATGA TGGCGCCGAC GGCTTTGGGT TCTATTATCG CGAAATCGAC CGTAACGCAG
GTTTTACGAC GTTTGGGATA CCGGAAGACT CTGGTCGGCA TTACGGTATT TATCGGTCTG
ATGATTGCCC AGTTTTCGTT TCAATCTCCC GCCATGCCGA TCTGGATGCT CGTGCTGCCG
CTATTCATTC TTGGAATGGC GATGTCCACT CAATTTACGG CAATGAATAC GATTACTCTT
GCGGATCTGA CGGACGATAA TGCCAGCAGC GGCAACAGCG TTCTGGCGGT TACGCAGCAA
TTGTCTATCA GTTTAGGCGT GGCGATCAGC GCGGCGGTGT TACGTATTTA TGAAGGTTTT
GCGGGCACCA GTACCGTGGA ACAGTTCCAC TGTACCTTTA TCACGATGGG GGCGATCACT
ATCGTGTCCG CGCTGATGTT TATGCTGCTA AGAGCCAAAG ACGGCAACAA TCTGATTAAA
GAACGGCATA AATCTAAACC GACCCACGCA CCGTCAAAAC CGGAGTAA
 
Protein sequence
MTSKKARSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA 
MLIPVSGWLA DRFGTRRVFM VAVSLFTLGS LACALSSSLS ELVIFRVVQG VGGAMMMPVA
RLALLRAYPR SELLPVLNFV TMPGLVGPIL GPVLGGVLVT WASWHWIFLI NIPIGVAGIL
YARKYMPNFT TPRRKFDMTG FFLFGLSLVL FSSGMELFGE KIVATWIASA IIFCSIVLLL
AYIRHARRHP TPLISLSLFK TRTFSVGIAG NLATRLGTGC VPFLMPLMLQ VGFGYPALIA
GCMMAPTALG SIIAKSTVTQ VLRRLGYRKT LVGITVFIGL MIAQFSFQSP AMPIWMLVLP
LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAIS AAVLRIYEGF
AGTSTVEQFH CTFITMGAIT IVSALMFMLL RAKDGNNLIK ERHKSKPTHA PSKPE