Gene SNSL254_A1726 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1726 
Symbol 
ID6486417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1698151 
End bp1699269 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content56% 
IMG OID642737106 
Productsgc region protein SgcX 
Protein accessionYP_002040858 
Protein GI194443032 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.955403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTT CTGTGCAGGA AACGCTTTTT TCTTTACTGC GGCTAAACGG GATTTCAGGA 
CATGAAAGCA GTATTGCAAA CGTTATGCAG CACGCGTTTG AACAGCAGGC CAAAGACGTC
TGGCGGGATC GCTTAGGCAA TGTCGTCGCC CGTTATGGCA GCGACAAATC CGACGCGCTT
CGCCTGATGA TTTTTGCGCA TATGGATGAA GTCGGTTTTA TGGTACGCAA GATCGAACCC
TCCGGCTTTT TACGTTTTGA ACGCGTGGGC GGCCCGGCGC AAATTACTAT GCCCGGTTCG
GTCGTGACGC TTGCCGGACG TTCAGGCGAT ATCATGGGCT GTATCGGTAT TAAAGCATAT
CACTTCGCGA AGGGTGACGA GCGCACCCAG CCTCCCGCGC TCGATAAACT CTGGATTGAT
ATCGGCGCAA AAGATAAAGC GGATGCCGAA CGAATGGGTA TTCAGGTGGG GACGCCAGTA
ACCCTTTACA ACCCGCCGCA CTGTCTGGGC AACGACCTGG TATGCAGTAA GGCGCTGGAT
GACAGACTGG GGTGTACGGC GCTACTGGGC GTCGCCGAGG CTCTCGCCTC CACACCGCTC
GATATCGCGG TGTTCCTGGT CGCGTCGGTA CAGGAAGAGT TCAATATTCG CGGGATTGTT
CCCGTTTTAC GACGCGTGCG CCCCGACCTG GCGATTGGTA TTGATATCAC CCCCTCCTGC
GACACGCCTG ACCTGCAGGA TTACTCAGAT GTGCGGGTCA ACCACGGCGT CGGCATCACC
TGTCTGAACT ATCACGGACG CGGTACGTTG GCGGGACTGA TTACGCCGCC GCGTTTGCTG
CGGATGCTGG AGACCACCGC GCACGAAAAT AATATTCCCG TACAGCGAGA AGTCGCGCCA
GGCGTCATCA CCGAAACCGG CTACATTCAG GTTGAACTGG ACGGTATTCC CTGCGCCAGT
CTTTCTATTC CCTGCCGCTA TACCCACTCG CCAGCCGAAG TCGCCAGCCT GCGCGACCTG
GCTGATTGTA TCCGTTTACT GACTGCGCTG GCCAATATGT CGCCAGAACA GTTTCCCATT
GAGCCTGAAA CAGGCGCTAC ACAAGAGGCA CGACCATGA
 
Protein sequence
MTFSVQETLF SLLRLNGISG HESSIANVMQ HAFEQQAKDV WRDRLGNVVA RYGSDKSDAL 
RLMIFAHMDE VGFMVRKIEP SGFLRFERVG GPAQITMPGS VVTLAGRSGD IMGCIGIKAY
HFAKGDERTQ PPALDKLWID IGAKDKADAE RMGIQVGTPV TLYNPPHCLG NDLVCSKALD
DRLGCTALLG VAEALASTPL DIAVFLVASV QEEFNIRGIV PVLRRVRPDL AIGIDITPSC
DTPDLQDYSD VRVNHGVGIT CLNYHGRGTL AGLITPPRLL RMLETTAHEN NIPVQREVAP
GVITETGYIQ VELDGIPCAS LSIPCRYTHS PAEVASLRDL ADCIRLLTAL ANMSPEQFPI
EPETGATQEA RP