Gene SNSL254_A3743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3743 
SymboltsgA 
ID6483677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3604089 
End bp3605270 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content55% 
IMG OID642739012 
Producthypothetical protein 
Protein accessionYP_002042723 
Protein GI194446244 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.0396479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACA GCAACCGCAT CAAGCTCACA TGGATCAGCT TTCTTTCCTA CGCCCTGACC 
GGGGCGCTGG TGATTGTCAC CGGGATGGTG ATGGGAAATA TCGCAGACTA TTTTCAGCTG
CCCGTTTCCA GCATGAGTAA CACCTTTACT TTCCTGAATG CCGGGATTTT GATCTCGATC
TTCCTCAATG CGTGGCTGAT GGAAATCATC CCGCTGAAAA CACAGCTACG CTTTGGTTTT
ATCCTGATGG TGCTGGCGGT GGCCGGGCTG ATGTTCAGCC ATAGCCTGGC GTTGTTCTCA
GCGGCGATGT TTGTGCTGGG GCTGGTCAGC GGGATCACCA TGTCGATTGG CACCTTCCTG
ATTACGCAAC TGTATGAAGG GCGTCAGCGC GGTTCCCGAC TGCTGTTTAC CGACTCCTTC
TTCAGCATGG CGGGGATGAT TTTTCCTATG GTCGCCGCCT TCCTGCTGGC GCGTAGTATT
GAGTGGTACT GGGTCTACGC CTGCATCGGC CTGGTCTACC TGGCGATTTT CATCCTGACC
TTCGGCTGTG AATTTCCGGC GCTGGGTAAA CATGCGCAGC ACTCTCAGGC ACCTGTCGTC
AAAGAAAAAT GGGGCATTGG CGTACTGTTT CTCGCCGTCG CCGCGCTGTG CTATATCCTC
GGTCAATTGG GCTTTATCTC CTGGGTGCCG GAATACGCCA AAGGCCTCGG CATGAGCCTG
AATGACGCCG GGGCGCTGGT GAGTGATTTC TGGATGTCCT ATATGTTTGG CATGTGGGCG
TTCAGCTTTA TCCTGCGCTT TTTCGATCTG CAACGCATTC TGACCGTACT GGCGGGTATG
GCGGCGGTAC TGATGTATTT GTTTATTACC GGCACGCAGG CGCATATGCC GTGGTTTATT
CTGACGCTGG GCTTCTTCTC CAGCGCCATT TATACCTCCA TCATTACGCT GGGATCGCAG
CAAACGAAAG TGGCCTCGCC TAAGCTGGTT AACTTTATTC TGACCTGCGG CACTATCGGA
ACGATGCTGA CCTTCGTCGT CACCGGCCCG ATTGTGGCGC ACAGCGGCCC ACAGGCGGCG
TTACTCACCG CGAATGGTCT GTATGCGGTG GTCTTTGTGA TGTGCTTTGC GCTCGGTTTT
GTATCCCGTC ATCGTCAGCA TAGCGCGCCG GCTACGCATT GA
 
Protein sequence
MTNSNRIKLT WISFLSYALT GALVIVTGMV MGNIADYFQL PVSSMSNTFT FLNAGILISI 
FLNAWLMEII PLKTQLRFGF ILMVLAVAGL MFSHSLALFS AAMFVLGLVS GITMSIGTFL
ITQLYEGRQR GSRLLFTDSF FSMAGMIFPM VAAFLLARSI EWYWVYACIG LVYLAIFILT
FGCEFPALGK HAQHSQAPVV KEKWGIGVLF LAVAALCYIL GQLGFISWVP EYAKGLGMSL
NDAGALVSDF WMSYMFGMWA FSFILRFFDL QRILTVLAGM AAVLMYLFIT GTQAHMPWFI
LTLGFFSSAI YTSIITLGSQ QTKVASPKLV NFILTCGTIG TMLTFVVTGP IVAHSGPQAA
LLTANGLYAV VFVMCFALGF VSRHRQHSAP ATH