Gene SeHA_C2995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C2995 
Symbol 
ID6489162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp2933951 
End bp2935135 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content59% 
IMG OID642743151 
Productmajor facilitator family transporter 
Protein accessionYP_002046775 
Protein GI194448840 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.00244716 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGAAAC CCACTCATGG GCTTAGCCCG GCGCTGATCG TTTTAATGTC TGTGGCCACG 
GGTCTGGCGG TCGCCAGCAA CTACTACGCC CAGCCGCTGC TTGATACCAT CGCGCATCAC
TTTTCGCTTT CCGCCAGCTC CGCAGGGTTT ATCGTTACCG CCGCGCAGTT GGGCTATGCC
GCTGGCCTGT TGTTTCTGGT GCCGCTCGGC GACATGTTTG AACGCCGAAC GCTGATTGTC
TCCATGACGT TGCTGGCGGC TGGCGGAATG CTGATCACCG CCAGCAGTCA GTCGCTTAGC
ATGATGATAC TCGGAACGGC CTTAACCGGA CTGTTCTCCG TGGTGGCGCA GATTCTGGTT
CCGCTGGCCG CCACACTTGC GACGCCCGCC ACCCGCGGTA AAGTGGTCGG CACCATTATG
AGCGGCCTGT TGCTGGGGAT CCTGCTGGCG CGAACGGTCG CCGGACTGCT GGCAAACCTC
GGCGGTTGGC GCACCGTATT TTGGGTAGCG TCGGCGCTGA TGGCGCTGAT GGCCGTCGCG
TTATGGCGCG GACTGCCAAA GCTCAAATCC GACACCCATC TTAACTACCC GCAACTGTTG
GGTTCTGTAT TCAGCCTGTT TATTCACGAT AAGCTGCTGC GTACCCGCGC ACTGCTGGGC
TGTCTGACCT TTGCTAATTT CAGCATCCTC TGGACATCAA TGGCCTTTTT GCTCGCCGCG
CCGCCGTTTA GCTACTCCGA GGGGATGATT GGCCTGTTTG GCCTGGCGGG GGCCGCCGGC
GCTTTAGGCG CGCGTCCGGC TGGCGGATTT GCCGATAAAG GTAAATCTCA CCTCACCACC
ACGTTCGGCT TACTGCTGCT GTTACTTTCC TGGCTGGCTA TCTGGCTTGG GCACACCTCG
GTACTGGCGC TGATTATTGG CATTCTGGTA CTGGACCTCA CCGTTCAGGG GGTACATATC
ACCAATCAGA CGGTCATCTA TCGTTTGCAT CCGGATGCGC GTAACCGGCT CACCGCCGGC
TATATGACCA GCTACTTTAT CGGTGGCGCC GCGGGGTCGC TGATTTCCGC CTCCGCCTGG
CAACATGCCG GCTGGGCCGG CGTTTGTCTG GCGGGTGTCA CGGTAGCCTT ACTTAATTTA
CTGGTCTGGT GGCGAGGTTT TCACCGACAG GAAGCCGTAA ATTAA
 
Protein sequence
MTKPTHGLSP ALIVLMSVAT GLAVASNYYA QPLLDTIAHH FSLSASSAGF IVTAAQLGYA 
AGLLFLVPLG DMFERRTLIV SMTLLAAGGM LITASSQSLS MMILGTALTG LFSVVAQILV
PLAATLATPA TRGKVVGTIM SGLLLGILLA RTVAGLLANL GGWRTVFWVA SALMALMAVA
LWRGLPKLKS DTHLNYPQLL GSVFSLFIHD KLLRTRALLG CLTFANFSIL WTSMAFLLAA
PPFSYSEGMI GLFGLAGAAG ALGARPAGGF ADKGKSHLTT TFGLLLLLLS WLAIWLGHTS
VLALIIGILV LDLTVQGVHI TNQTVIYRLH PDARNRLTAG YMTSYFIGGA AGSLISASAW
QHAGWAGVCL AGVTVALLNL LVWWRGFHRQ EAVN