Gene SeHA_C0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0478 
Symbol 
ID6489789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp482732 
End bp483943 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content52% 
IMG OID642740746 
Productmajor facilitator superfamily transporter 
Protein accessionYP_002044413 
Protein GI194450714 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones77 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCCT GGAAAGTTAA TCTAATTTCC GTTTGGTTTG GATGTTTTTT TACCGGGCTG 
GCAATCAGCC AAATCCTGCC ATTCTTACCC CTTTATATTT CCCAGCTTGG CGTCTCTTCC
CATGAAGCGT TATCAATGTG GTCCGGGTTA ACGTTTAGCA TCACGTTTCT TATTTCCGCC
ATTGTGTCGC CGATGTGGGG CAGTCTCGCC GATCGTAAAG GGCGTAAACT GATGCTATTG
CGCGCGTCGC TCGGGATGGC GATAGCTATT CTACTGCAGG CATTTGCGAC CCATGTCTGG
CAACTTTTCC TGCTGCGCGG AATCATGGGG TTAACGTCAG GCTATATCCC CAATGCTATG
GCGCTGGTAG CCTCTCAGGT ACCACGCGAA CGTAGCGGCT GGGCGCTCAG TACGCTTTCT
ACCGCGCAGA TCAGCGGCGT TATCGGCGGG CCGTTAATGG GCGGCTTTGT CGCGGATCAT
ATCGGGCTGC GGGCGGTATT TCTGATTACC GCCATGCTGT TGGTGGTGAG CTTTCTGGTC
ACGCTATTTT TAATTAAAGA AGGCGTGCGT CCGGTCATCA AGAAAAGCGA ACGCCTGAGC
GGTAAAGCCG TTTTTGCGTC GTTACCTTAT CCTGCGCTGG TGATCAGTTT GTTTTTTACC
ACGATGGTCA TTCAACTCTG TAATGGTTCC ATCAGTCCAA TCCTGGCGCT GTTTATCAAA
TCAATGATGC CGGACAGTAA TAACATCGCC TTTCTTAGCG GGTTAATTGC CTCGGTGCCC
GGTATCTCTG CGCTTATCTC CGCGCCTCGC CTGGGAAAAC TTGGCGACAG AATCGGCACG
GAAAGAATTC TGATGGCCAC GCTTATCTGC GCAGTGCTGC TTTTCTTCGC GATGTCCTGG
GTCACCACGC CGTTCCAGTT GGGGCTGTTA CGTTTCTTGT TAGGCTTTGC CGATGGCGCG
ATGTTACCCG CCGTACAAAC GTTATTGGTG AAATACTCCA GCGATCAAAT TACCGGACGT
ATTTTTGGCT ACAACCAGTC ATTTATGTAC CTGGGCAACG TGGTTGGGCC GTTGATGGGC
GCGACGGTAT CGGCGATGGC CGGTTTCCGC TGGGTTTTTA TCGCTACGGC GGCGATCGTG
TTGATCAATA TCGGGCAACT GACCCTGGCG TTACGTCGTC GGCGTAACGC GCAAAAAGCG
AAAGGCCAAT AG
 
Protein sequence
MESWKVNLIS VWFGCFFTGL AISQILPFLP LYISQLGVSS HEALSMWSGL TFSITFLISA 
IVSPMWGSLA DRKGRKLMLL RASLGMAIAI LLQAFATHVW QLFLLRGIMG LTSGYIPNAM
ALVASQVPRE RSGWALSTLS TAQISGVIGG PLMGGFVADH IGLRAVFLIT AMLLVVSFLV
TLFLIKEGVR PVIKKSERLS GKAVFASLPY PALVISLFFT TMVIQLCNGS ISPILALFIK
SMMPDSNNIA FLSGLIASVP GISALISAPR LGKLGDRIGT ERILMATLIC AVLLFFAMSW
VTTPFQLGLL RFLLGFADGA MLPAVQTLLV KYSSDQITGR IFGYNQSFMY LGNVVGPLMG
ATVSAMAGFR WVFIATAAIV LINIGQLTLA LRRRRNAQKA KGQ