Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B1549 |
Symbol | |
ID | 6794890 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | - |
Start bp | 1508958 |
End bp | 1510076 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642775788 |
Product | putative sgc region protein SgcX |
Protein accession | YP_002146424 |
Protein GI | 197247688 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 38 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTTTT CTGTGCAGGA AACGCTTTTT TCTTTACTGC GGCTAAACGG GATTTCAGGA CATGAAAGCA GTATTGCAAA CGTTATGCAG CACGCGTTTG AACAGCAGGC CAAAGACGTC TGGCGGGATC GCTTAGGCAA TGTCGTCGCC CGTTATGGCA GCGATAAACC CGACGCGCTT CGCCTGATGA TTTTTGCGCA TATGGATGAA GTGGGTTTTA TGGTACGCAA GATCGAACCC TCCGGTTTTT TACGTTTTGA ACGCGTGGGC GGCCCGGCGC AAATTACTAT GCCCGGTTCG ATTGTGACGC TTGCCGGACG TTCAGGCGAT ATCATGGGCT GTATCGGTAT TAAAGCATAT CACTTCGCGA AGGGTGACGA GCGCACCCAG CCACCCGCTC TCGATAAACT CTGGATTGAT ATCGGCGCAA AAGATAAAGC GGATGCCGAA CGAATGGGTA TTCAGGTGGG GACGCCAGTA ACCCTTTACA ACCCGCCGCA CTGTCTGGGC AACGACCTGG TGTGCAGTAA GGCGCTGGAT GACAGGCTGG GGTGTACGGC GCTACTGGGC GTCGCCGAGG CTATCGCCTC CACACCGCTT GATATCGCGG TATTCCTGGT CGCGTCGGTA CAGGAAGAGT TCAATATTCG CGGTATTATT CCCGTTTTAC GACGCGTGCG CCCCGACCTG GCGATTGGTA TTGATATCAC CCCCTCCTGC GACACGCCTG ACCTGCAGGA TTACTCGGAT GTGCGGGTCA ACCACGGCGT CGGCATCACC TGTCTGAACT ATCACGGACG CGGTACGTTG GCGGGACTGA TTACGCCGCC GCGTTTGCTG CGGATGCTGG AGACCACCGC GCACGAAAAT AATATTCCCG TACAGCGAGA AGTCGCGCCA GGCGTCATCA CCGAAACCGG CTACATTCAG GTTGAACTGG ACGGTATTCC CTGCGCCAGT CTTTCTATTC CCTGCCGCTA TACCCACTCG CCAGCCGAAG TCGCCAGCCT GCGCGACCTG GCTGATTGTA TCCGTTTACT GACTGCGCTG GCCAATATGT CGCCAGAGCA GTTTCCCATT GAGCCTGAAA CAGGCGCTAC ACAAGAGGCA CGACCATGA
|
Protein sequence | MTFSVQETLF SLLRLNGISG HESSIANVMQ HAFEQQAKDV WRDRLGNVVA RYGSDKPDAL RLMIFAHMDE VGFMVRKIEP SGFLRFERVG GPAQITMPGS IVTLAGRSGD IMGCIGIKAY HFAKGDERTQ PPALDKLWID IGAKDKADAE RMGIQVGTPV TLYNPPHCLG NDLVCSKALD DRLGCTALLG VAEAIASTPL DIAVFLVASV QEEFNIRGII PVLRRVRPDL AIGIDITPSC DTPDLQDYSD VRVNHGVGIT CLNYHGRGTL AGLITPPRLL RMLETTAHEN NIPVQREVAP GVITETGYIQ VELDGIPCAS LSIPCRYTHS PAEVASLRDL ADCIRLLTAL ANMSPEQFPI EPETGATQEA RP
|
| |