Gene SeSA_A1729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A1729 
Symbol 
ID6518227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp1672518 
End bp1673636 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content56% 
IMG OID642746833 
Productputative sgc region protein SgcX 
Protein accessionYP_002114636 
Protein GI194734384 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTT CTGTGCAGGA AACGCTTTTT TCTTTACTGC GGCTAAACGG GATTTCAGGA 
CATGAAAGCA GTATTGCAAA CGTTATGCAG CACGCGTTTG AACAGCAGGC CAAAGACGTC
TGGCGGGATC GCCTGGGCAA TGTCGTCGCC CGTTATGGCA GCGATAAATC CGACGCGCTT
CGCCTGATGA TTTTTGCGCA TATGGATGAA GTGGGTTTTA TGGTACGCAA GATCGAACCC
TCCGGTTTTT TACGTTTTGA ACGCGTGGGC GGCCCGGCGC AAATTACTAT GCCCGGTTCG
ATTGTGACGC TTGCCGGACG TTCAGGCGAT ATCATGGGCT GTATCGGTAT TAAAGCATAT
CACTTCGCGA AGGGTGACGA GCGCACCCAG CCTCCCGCTC TCGATAAACT CTGGATTGAT
ATCGGCGCAA AAGATAAAGC GGATGCCGAA CGAATGGGTA TTCAGGTGGG GACGCCAGTA
ACCCTTTACA ACCCGCCGCA CTGTCTGGGC AACGACCTGG TATGCAGTAA GGCGCTGGAT
GACAGGCTGG GGTGTACGGC GCTACTGGGC GTCGCCGAGG CCCTCGCCTC CACACCGCTC
GATATCGCGG TATTCCTGGT CGCGTCGGTG CAGGAAGAGT TCAATATTCG CGGCATTATT
CCCGTTTTAC GACGCGTGCG CCCCGACCTG GCGATTGGTA TTGATATCAC CCCATCCTGC
GACACGCCTG ACCTGCAGGA TTACTCGGAT GTGCGGGTCA ACCACGGCGT CGGCATCACC
TGTCTGAACT ATCACGGACG CGGTACGTTG GCGGGACTGA TTACGCCGCC GCGTTTGCTG
CGGATGCTGG AGACCACCGC GCACGAAAAT AATATTCCCG TACAGCGAGA AGTCGCGCCA
GGCGTCATCA CCGAAACCGG CTACATTCAG GTTGAACTGG ACGGTATTCC CTGCGCCAGC
CTTTCTATTC CTTGCCGCTA TACCCACTCG CCAGCCGAAG TCGCCAGCCT GCGCGACCTG
ACTGATTGTA TCCGTTTACT GACTGCGCTG GCCAATATGT CGCCAGAACA GTTTCCCATT
GAGCCTGAAA CAGGCGCTAC ACAAGAGGCA CGACCATGA
 
Protein sequence
MTFSVQETLF SLLRLNGISG HESSIANVMQ HAFEQQAKDV WRDRLGNVVA RYGSDKSDAL 
RLMIFAHMDE VGFMVRKIEP SGFLRFERVG GPAQITMPGS IVTLAGRSGD IMGCIGIKAY
HFAKGDERTQ PPALDKLWID IGAKDKADAE RMGIQVGTPV TLYNPPHCLG NDLVCSKALD
DRLGCTALLG VAEALASTPL DIAVFLVASV QEEFNIRGII PVLRRVRPDL AIGIDITPSC
DTPDLQDYSD VRVNHGVGIT CLNYHGRGTL AGLITPPRLL RMLETTAHEN NIPVQREVAP
GVITETGYIQ VELDGIPCAS LSIPCRYTHS PAEVASLRDL TDCIRLLTAL ANMSPEQFPI
EPETGATQEA RP