Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C1790 |
Symbol | |
ID | 6491471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 1749883 |
End bp | 1751001 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642742006 |
Product | sgc region protein SgcX |
Protein accession | YP_002045651 |
Protein GI | 194448087 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00171255 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTTTT CTGTGCAGGA AACGCTTTTT TCTTTACTGC GGCTAAACGG GATTTCAGGA CATGAAAGCA GTATTGCAAA CGTTATGCAG CACGCGTTTG AACAGCAGGC CAAAGACGTC TGGCGGGATC GCTTAGGCAA TGTCGTCGCC CGTTATGGCA GCGATAAACC CGACGCGCTT CGCCTGATGA TTTTTGCGCA TATGGATGAA GTGGGTTTTA TGGTACGCAA GATCGAACCC TCCGGTTTTT TACGTTTTGA ACGCGTGGGC GGCCCGGCGC AAATTACTAT GCCCGGTTCG ATTGTGACGC TTGCCGGACG TTCAGGCGAT ATCATGGGCT GTATCGGTAT TAAAGCATAT CACTTCGCGA AGGGTGACGA GCGCACCCAG CCACCCGCTC TCGATAAACT CTGGATTGAT ATCGGCGCAA AAGATAAAGC GGATGCCGAA CGAATGGGTA TTCAGGTGGG GACGCCAGTA ACCCTTTACA ACCCGCCGCA CTGTCTGGGC AACGACCTGG TATGCAGTAA GGCGCTGGAT GACAGACTGG GGTGTACGGC GCTACTGGGC GTCGCCGAGG CTCTCGCCTC CACACCGCTC GATATCGCGG TGTTCCTGGT CGCGTCGGTG CAGGAAGAGT TCAATATTCG CGGGATTGTT CCCGTTTTAC GACGCGTGCG CCCCGACCTG GCGATTGGTA TTGATATCAC CCCCTCCTGC GACACGCCTG ACCTGCAGGA TTACTCGGAT GTGCGGGTCA ACCACGGCGT CGGCATCACC TGTCTGAACT ATCACGGACG CGGTACGTTG GCGGGACTGA TTACGCCGCC GCGTTTGCTG CGGATGCTGG AGACCACCGC GCACGAAAAT AATATTCCCG TACAGCGAGA AGTCGCGCCA GGCGTCATCA CCGAAACCGG CTACATTCAG GTTGAACTGG ACGGTATTCC CTGCGCCAGT CTTTCTATTC CCTGCCGCTA TACCCACTCG CCAGCCGAAG TCGCCAGCCT GCGCGACCTG GCTGATTGTA TCCGTTTACT GACTGCGCTG GCCAATATGT CGCCAGAACA GTTTCCCATT GAGCCTGAAA CAGGCGCTAC ACAAGAGGCA CGACCATGA
|
Protein sequence | MTFSVQETLF SLLRLNGISG HESSIANVMQ HAFEQQAKDV WRDRLGNVVA RYGSDKPDAL RLMIFAHMDE VGFMVRKIEP SGFLRFERVG GPAQITMPGS IVTLAGRSGD IMGCIGIKAY HFAKGDERTQ PPALDKLWID IGAKDKADAE RMGIQVGTPV TLYNPPHCLG NDLVCSKALD DRLGCTALLG VAEALASTPL DIAVFLVASV QEEFNIRGIV PVLRRVRPDL AIGIDITPSC DTPDLQDYSD VRVNHGVGIT CLNYHGRGTL AGLITPPRLL RMLETTAHEN NIPVQREVAP GVITETGYIQ VELDGIPCAS LSIPCRYTHS PAEVASLRDL ADCIRLLTAL ANMSPEQFPI EPETGATQEA RP
|
| |