Gene SeHA_C3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3232 
Symbol 
ID6489866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3158441 
End bp3159859 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content53% 
IMG OID642743370 
ProductL-arabinose/proton symport protein 
Protein accessionYP_002046987 
Protein GI194449459 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.852071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.0000142975 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCTCTA TTAATCATGA CTCTGCTTTA ACGCCGCGTT CGCTTCGCGA CACACGACGT 
ATGAATATGT TTGTTTCGGT TTCTGCAGCG GTAGCGGGAC TGTTATTTGG TCTGGATATC
GGCGTTATCG CCGGGGCGCT GCCTTTTATT ACCGACCATT TCGTGCTGAC CAGTCGGCTG
CAGGAGTGGG TCGTCAGCAG TATGATGCTT GGCGCGGCAA TTGGCGCATT ATTTAACGGC
TGGCTTTCAT TCCGGCTGGG GCGTAAGTAT AGCCTGATGG CCGGCGCGAT TTTGTTCGTG
CTCGGCTCGC TGGGGTCGGC GTTTGCTTCC AGCGTGGAAG TATTGATTGG CGCCCGCGTG
ATACTGGGCG TAGCAGTAGG GATTGCCTCC TATACCGCGC CGCTTTATCT CTCTGAAATG
GCTAGCGAAA ATGTTCGCGG CAAAATGATC AGTATGTATC AACTGATGGT GACGTTAGGC
ATTGTGCTGG CTTTTTTATC CGATACGGCA TTTAGCTACA GCGGCAACTG GCGCGCGATG
TTGGGCGTGC TGGCGCTGCC TGCGGTGTTG CTCATTATTC TCGTGGTATT CCTGCCGAAT
AGTCCGCGTT GGCTGGCGCA AAAAGGTCGC CATATTGAAG CGGAAGAGGT GCTGCGTATG
CTGCGCGATA CCTCGGAAAA AGCCCGTGAT GAACTGAATG AAATTCGGGA AAGCCTCAAA
CTCAGGCAGG GAGGGTGGGC ATTATTTAAA GCTAACCGCA ATGTTCGCCG CGCCGTGTTC
CTCGGTATGC TGCTACAGGC AATGCAGCAG TTCACCGGCA TGAACATCAT TATGTACTAT
GCGCCGCGCA TTTTTAAAAT GGCCGGCTTT ACCACCACGG AACAGCAAAT GATCGCCACG
CTGGTGGTCG GGCTGACCTT TATGTTCGCG ACGTTTATCG CCGTCTTTAC GGTCGATAAG
GCCGGGCGTA AACCGGCGTT AAAAATCGGT TTCAGCGTAA TGGCGTTAGG GACATTGGTG
TTGGGCTACT GCCTGATGCA GTTTGATAAC GGTACGGCAT CAAGCGGTCT CTCCTGGCTT
TCCGTTGGGA TGACGATGAT GTGTATCGCC GGTTACGCGA TGAGCGCCGC TCCGGTGGTG
TGGATACTGT GTTCGGAAAT CCAGCCGCTG AAATGCCGTG ATTTTGGCAT TACCTGTTCA
ACCACGACAA ACTGGGTATC GAACATGATC ATCGGCGCGA CATTCCTGAC ACTGTTGGAC
AGCATTGGCG CGGCAGGTAC ATTCTGGCTT TACACCGCGC TGAATATCGC TTTTATCGGC
ATCACCTTCT GGCTGATTCC GGAAACCAAA AATGTCACCC TGGAGCACAT CGAACGCAAG
CTGATGGCGG GCGAGAAGCT AAGAAATATT GGCGTGTAA
 
Protein sequence
MVSINHDSAL TPRSLRDTRR MNMFVSVSAA VAGLLFGLDI GVIAGALPFI TDHFVLTSRL 
QEWVVSSMML GAAIGALFNG WLSFRLGRKY SLMAGAILFV LGSLGSAFAS SVEVLIGARV
ILGVAVGIAS YTAPLYLSEM ASENVRGKMI SMYQLMVTLG IVLAFLSDTA FSYSGNWRAM
LGVLALPAVL LIILVVFLPN SPRWLAQKGR HIEAEEVLRM LRDTSEKARD ELNEIRESLK
LRQGGWALFK ANRNVRRAVF LGMLLQAMQQ FTGMNIIMYY APRIFKMAGF TTTEQQMIAT
LVVGLTFMFA TFIAVFTVDK AGRKPALKIG FSVMALGTLV LGYCLMQFDN GTASSGLSWL
SVGMTMMCIA GYAMSAAPVV WILCSEIQPL KCRDFGITCS TTTNWVSNMI IGATFLTLLD
SIGAAGTFWL YTALNIAFIG ITFWLIPETK NVTLEHIERK LMAGEKLRNI GV