Gene SeHA_C1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C1066 
Symbol 
ID6488469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp1053999 
End bp1055147 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content52% 
IMG OID642741308 
Productputative MFS family transporter protein 
Protein accessionYP_002044960 
Protein GI194448266 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0109071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.857423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCT ATACCCGTCC CGTCATGCTT TTGCTGTGCG GGCTACTTTT GTTGACTCTG 
GCCATTGCGG TACTGAATAC GCTTGTGCCG CTGTGGCTTG CTCAGGCAAA CCTTCCGACC
TGGCAGGTGG GGATGGTCAG CTCGTCTTAT TTTACCGGCA ATCTGGTCGG GACGTTATTT
ACCGGGTATT TAATTAAACG CATTGGGTTT AACCGTAGCT ATTATCTTGC CTCGCTGATC
TTCGCCGCGG GTTGTGTCGG ATTGGGGGTG ATGGTGGGGT TCTGGAGCTG GATGAGCTGG
CGTTTTATTG CCGGTATCGG CTGCGCCATG ATTTGGGTGG TTGTCGAAAG CGCGTTGATG
TGCAGCGGAA CCTCGCATAA TCGCGGGCGC CTGCTGGCTG CCTATATGAT GGTCTATTAC
ATGGGGACCT TCCTTGGACA ATTATTGGTC AGTAAAGTAT CTGGTGAATT GCTGCACGTC
CTTCCCTGGG TGACCGGAAT GATTCTGGCG GGAATTCTGC CGCTACTCTT TACCCGAATT
GTAAATCAGC AAACGCAGAC ACGTCATTCC TCTTCTATTA GCGCCATGCT GAAGCTACGC
CAGGCGCGTC TTGGCGTGAA TGGTTGCATT ATTTCCGGCA TTGTTCTTGG TTCATTATAT
GGCCTGATGC CGTTATATCT GAAGCATCAG GGGATGGCTA ACGCCAGCAT CGGTTTCTGG
ATGGCGGTGC TGGTGAGCGC CGGCATTTTG GGGCAATGGC CAATGGGACG TCTGGCGGAC
AAATTTGGTC GCTTGCTGGT ATTACGCGTA CAGGTATTCG TTGTCATACT CGGTAGTATT
GCCATGTTAA CCCAGGCGGC GATGGCGCCA GCTCTGTTTA TTCTGGGGGC GGCGGGTTTT
ACGCTTTATC CCGTCGCAAT GGCCTGGGCC TGTGAAAAAG TCGAACATCA CCAGCTTGTG
GCAATGAACC AGGCGCTGTT GTTAAGTTAT ACGGTAGGGA GCCTGTTGGG GCCGTCTTTT
GCTGCGATGT TAATGCAGAA TTATTCAGAT AATCTGCTGT TTATTATGAT CGCCAGCGTA
TCGTTTATTT ATCTGCTGAT GCTGTTACGT AACGCCGGCC AGACGCCTAA TCCTGTCGCC
CACATCTAA
 
Protein sequence
MSTYTRPVML LLCGLLLLTL AIAVLNTLVP LWLAQANLPT WQVGMVSSSY FTGNLVGTLF 
TGYLIKRIGF NRSYYLASLI FAAGCVGLGV MVGFWSWMSW RFIAGIGCAM IWVVVESALM
CSGTSHNRGR LLAAYMMVYY MGTFLGQLLV SKVSGELLHV LPWVTGMILA GILPLLFTRI
VNQQTQTRHS SSISAMLKLR QARLGVNGCI ISGIVLGSLY GLMPLYLKHQ GMANASIGFW
MAVLVSAGIL GQWPMGRLAD KFGRLLVLRV QVFVVILGSI AMLTQAAMAP ALFILGAAGF
TLYPVAMAWA CEKVEHHQLV AMNQALLLSY TVGSLLGPSF AAMLMQNYSD NLLFIMIASV
SFIYLLMLLR NAGQTPNPVA HI