Gene SeHA_C0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0420 
Symbol 
ID6490569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp418536 
End bp419768 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content43% 
IMG OID642740692 
Productputative permease 
Protein accessionYP_002044359 
Protein GI194448028 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.682839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.00108676 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAATA ATATTGAAGA GACAATAGGT AAGTACTTGC CTATTTTAAT GATATTACCT 
CTGGCGGGTT TGGCAGAGTT AGCTTCTTTA TATTCAATCC AGGCTTTATT ACCAAAGTTA
AGTGAGGTTT ATAATATCCC ATTGAATCAG GTAGGGATGA TTTTGTCTGC TGAGGTGGGG
TTTCTGGCAT TAGCCATGTT ATTTAGCGGC ACGTTATCTG ACCGTTTTGG GCGAAAGCCA
ATCATTTTTT ATTCGTTGCT GGCAGGAGGT ATTTTAACTC TATTGTGCGC AACTGCATCA
TCATGGCCGA TGCTGGTGGT ATACCGCGCT TTGCTCGGCA TTGCAGTAAG TGGTATTACG
GCGGCCGTTA CAGTCTATAT CAGTGAAGAA GTCTCTCCTG CTTTGGCAGG GATTGTTACT
GGATATTTTA TTTTTGGCAA TTCGCTAGGG AGTATGTCTG GTCGTGTGTT TGCAACTTTG
ATGATGGAGC ATGTATCTAT AGATACGATT TTCTTTATCT TCGGTGGTGT TCTGATTGCT
ATGGCACTCG CGGTGAAGTT GTTTCTCCCG ACATCCCGAC AGTTTGTTCC TACACCTTCA
CTACAGCTTG GTGCAGTGTT GAAAGGCGGG TTGGAACATT TTAAGAATAT CCGAGTATCA
TTATGTTTTG TCATTGGATT TATTCTTTTC GGCTCTTTTA CGTCCATCTT TAATTTTCTG
GCGTTTTACC TGCATCGACC GCCTTATGAG CTGAGCTACA CCTGGATAGG TTTGATTCCA
GTTAGCTTCT CATTAACTTT TTTTCTTGCG CCATATGCTG CCCGTGTCGC GTTGAATATT
GGGTCGATGA ATGCGCTCAG TATACTGATC ATCTGTATGA TGGTCGGTGC ATTTCTCACG
CTAATCGCCC CTTCTCTGTG GGTTTTCATT TCAGGTATCG TTTTACTGTC AGTCGCATTT
TTCTCTGCTC ATTCCACCGT ATTAGCCTGG GTCAGTTCAC GGAGTCCAAA CGCAAAAGGA
CAGGCGACAT CGTTTTATCT GCTTTGCTAC TACTCTGGCG GTGCAGTAAT GGGGTATTTA
AACGGGTATC TTTTCTCCTG GCAGGGATGG AATGCTATTG CGGCATCATG TCTGATGATG
CTGGGGATAG GATTATTTAT CTGCCGGTTT ATTTTCGCAA AATATGAGAA ACAACCGCAA
ATCAAAAAAC AGTCAGTTCA GGAGAGTTTC TGA
 
Protein sequence
MKNNIEETIG KYLPILMILP LAGLAELASL YSIQALLPKL SEVYNIPLNQ VGMILSAEVG 
FLALAMLFSG TLSDRFGRKP IIFYSLLAGG ILTLLCATAS SWPMLVVYRA LLGIAVSGIT
AAVTVYISEE VSPALAGIVT GYFIFGNSLG SMSGRVFATL MMEHVSIDTI FFIFGGVLIA
MALAVKLFLP TSRQFVPTPS LQLGAVLKGG LEHFKNIRVS LCFVIGFILF GSFTSIFNFL
AFYLHRPPYE LSYTWIGLIP VSFSLTFFLA PYAARVALNI GSMNALSILI ICMMVGAFLT
LIAPSLWVFI SGIVLLSVAF FSAHSTVLAW VSSRSPNAKG QATSFYLLCY YSGGAVMGYL
NGYLFSWQGW NAIAASCLMM LGIGLFICRF IFAKYEKQPQ IKKQSVQESF