Gene SeHA_C4076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4076 
Symbol 
ID6488558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3959916 
End bp3961298 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content55% 
IMG OID642744176 
Productputative transporter 
Protein accessionYP_002047781 
Protein GI194451859 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.591118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTG AAATTCTCTC CGTTAAGGAG AAGATTGGCT ATGGTATGGG CGATGCCGCC 
AGCCACATCA TCTTTGATAA CGTCATGTTA TATATGATGT TCTTCTATAC CGATATTTTC
GGCATTCCCG CTGGTTTTGT TGGCACCATG TTTTTACTGG CGCGTGCGCT TGATGCCATC
TCCGACCCTT GTATGGGCCT GCTGGCCGAC CGCACCCGCT CTCGCTGGGG CAAATTCCGA
CCCTGGGTGC TGTTTGGCGC GTTGCCGTTT GGTATCGTTT GTGTGCTGGC TTATAGCACG
CCGGATCTCA GTCTGAACGG CAAAATGATT TATGCCGCCA TCACCTACAC GTTGCTCACC
CTTCTGTACA CTGTGGTCAA CATCCCTTAC TGCGCGTTGG GGGGTGTAAT AACCAATGAC
CCAACGCAGC GTATCTCCCT GCAATCCTGG CGCTTTGTGC TGGCAACGGC GGGCGGAATG
CTCTCTACCG TACTGATGAT GCCTCTGGTG AAACTGATTG GCGGCGAGAA TAAGGCGCTG
GGCTTCCAGG GGGGTATCGC GGCGCTCTCG GTGGTGGCGT TCCTGATGCT GGCGTTCTGC
TTCTTTACCA CCAAAGAGCG CGTTGAAGCG CCTGCCACCC ATACCTCCAT GCGTGAAGAC
CTGCGTGATA TCTGGCACAA CGACCAGTGG CGCATAGTCG GCCTGCTCAC CATCCTGAAT
ATTCTGGCGG TATGCGTGCG CGGCGGGGCG ATGATGTATT ACGTCACCTG GATATTGGGC
AAACCGGGCG TGTTTGTCGC CTTCCTCACC ACCTATTGTG TCGGCAACCT GATTGGCTCG
GCGCTGGCAA AACCGTTGAC CGACTGGAAA TGCAAAGTGA GCGTTTTCTG GTGGACCAAC
GCCTTACTCG CAGTAATCAG CGTGGCGATG TTCTTCGTAC CGATGCACGC CACGATCGCT
ATGTTCGTCT TTATCTTTGT GATTGGCGTA TTGCACCAGT TAGTCACGCC TATCCAGTGG
GTGATGATGT CTGACACCGT CGACTATGGC GAATGGTGTA ACGGCAAACG CCTGACGGGG
ATCAGTTTTG CCGGCACGTT GTTCGTGCTG AAACTGGGTC TTGCCCTCGG CGGGGCGCTG
ATTGGCTGGA TGCTGGCAGG CGGCGGTTAC GACGCGGCGG CGAAAACGCA AAACAGCGCC
ACGATCAGCA TCATCATCGC TCTGTTCACT ATCGTCCCGG CCATCTGTTA TCTGCTGAGC
GCCGCGATCG CTAAACGCTA CTACACCCTG AAAAGCCCGT TCCTGAAAAC CATTCTGGAG
CAACTGGCGC AGGGCGCACA CCGCAACGAA CAAGAATTTA CCCATAAAGA ATTGCAAAAC
TAA
 
Protein sequence
MKSEILSVKE KIGYGMGDAA SHIIFDNVML YMMFFYTDIF GIPAGFVGTM FLLARALDAI 
SDPCMGLLAD RTRSRWGKFR PWVLFGALPF GIVCVLAYST PDLSLNGKMI YAAITYTLLT
LLYTVVNIPY CALGGVITND PTQRISLQSW RFVLATAGGM LSTVLMMPLV KLIGGENKAL
GFQGGIAALS VVAFLMLAFC FFTTKERVEA PATHTSMRED LRDIWHNDQW RIVGLLTILN
ILAVCVRGGA MMYYVTWILG KPGVFVAFLT TYCVGNLIGS ALAKPLTDWK CKVSVFWWTN
ALLAVISVAM FFVPMHATIA MFVFIFVIGV LHQLVTPIQW VMMSDTVDYG EWCNGKRLTG
ISFAGTLFVL KLGLALGGAL IGWMLAGGGY DAAAKTQNSA TISIIIALFT IVPAICYLLS
AAIAKRYYTL KSPFLKTILE QLAQGAHRNE QEFTHKELQN