Gene SeHA_C3778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3778 
SymboltsgA 
ID6488339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3636365 
End bp3637546 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content55% 
IMG OID642743890 
Producthypothetical protein 
Protein accessionYP_002047496 
Protein GI194447901 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.974681 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACA GCAACCGCAT CAAGCTCACA TGGATCAGCT TTCTTTCCTA CGCCCTGACC 
GGGGCGCTGG TGATTGTCAC CGGGATGGTG ATGGGAAATA TCGCAGACTA TTTTCATCTG
CCCGTTTCCA GCATGAGTAA CACCTTTACT TTCCTGAATG CCGGGATTTT GATCTCGATC
TTCCTCAATG CGTGGCTGAT GGAAATCGTC CCGCTGAAAA CACAGCTACG CTTTGGTTTT
ATCCTGATGG TGCTGGCGGT GGCCGGGCTG ATGTTCAGCC ATAGCCTGGC GTTGTTCTCA
GCGGCGATGT TTGTGCTGGG GCTGGTCAGC GGGATCACCA TGTCGATTGG CACCTTCCTG
ATTACGCAAC TGTATGAAGG GCGTCAGCGC GGTTCCCGAC TGCTGTTTAC CGACTCCTTC
TTCAGCATGG CGGGAATGAT TTTTCCTATG GTCGCCGCCT TCCTGCTGGC GCGTAGTATT
GAGTGGTACT GGGTCTACGC CTGCATCGGC CTGGTCTACC TGGCGATTTT CATCCTGACC
TTCGGCTGTG AATTTCCGGC GCTGGGTAAA CATGCGCAGC ACTCTCAGGC GCCTGTCGTC
AAAGAAAAAT GGGGCATTGG CGTACTGTTT CTCGCCGTCG CCGCGCTGTG CTATATCCTC
GGTCAATTAG GCTTTATCTC CTGGGTGCCG GAATACGCCA AAGGCCTCGG CATGAGCCTG
AATGACGCCG GGGCGCTGGT GAGTGATTTC TGGATGTCCT ATATGTTTGG CATGTGGGCG
TTCAGCTTTA TCCTGCGCTT TTTCGATCTG CAACGCATTC TGACCGTACT GGCGGGTATG
GCGGCGGTAC TGATGTATCT GTTTATTACC GGCACGCAGG CGCATATGCC GTGGTTTATT
CTGACGCTGG GCTTCTTCTC CAGCGCCATT TATACCTCCA TCATTACGCT GGGATCGCAG
CAAACGAAAG TGGCCTCGCC TAAGCTGGTT AACTTTATTC TGACCTGCGG CACTATCGGA
ACGATGCTGA CCTTCGTCGT CACCGGCCCG ATTGTGGCGC ACAGCGGCCC ACAGGCGGCG
TTACTCACCG CGAATGGTCT GTATGCGGTG GTCTTTGTGA TGTGCTTTGC GCTCGGCTTT
GTCTCCCGTC ATCGTCAGCA TAGCGCGCCG GCTACGCATT GA
 
Protein sequence
MTNSNRIKLT WISFLSYALT GALVIVTGMV MGNIADYFHL PVSSMSNTFT FLNAGILISI 
FLNAWLMEIV PLKTQLRFGF ILMVLAVAGL MFSHSLALFS AAMFVLGLVS GITMSIGTFL
ITQLYEGRQR GSRLLFTDSF FSMAGMIFPM VAAFLLARSI EWYWVYACIG LVYLAIFILT
FGCEFPALGK HAQHSQAPVV KEKWGIGVLF LAVAALCYIL GQLGFISWVP EYAKGLGMSL
NDAGALVSDF WMSYMFGMWA FSFILRFFDL QRILTVLAGM AAVLMYLFIT GTQAHMPWFI
LTLGFFSSAI YTSIITLGSQ QTKVASPKLV NFILTCGTIG TMLTFVVTGP IVAHSGPQAA
LLTANGLYAV VFVMCFALGF VSRHRQHSAP ATH