Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3895 |
Symbol | |
ID | 6483353 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3779283 |
End bp | 3780788 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 642739159 |
Product | putative cellulose biosynthesis protein BcsE |
Protein accession | YP_002042870 |
Protein GI | 194442900 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03369] cellulose biosynthesis protein BcsE |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 67 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAACCG GCGGCGTCTG GTGGGTTAAC GCCGATCGCC AGCAAGATGC CATCAGTCTG GTGAATCAAA CGATTGCGTC ACAAACGGAG AATGCAAATG TCGCCGTCAT CGGCATGGAA GGCGATCCTG GCAAAGTAAT CAAATTAGAT GAATCTCACG GTCCGGAGAA AATCCGCTTA TTTACCATGC CGGATTCAGA AAAAGGGCTA TACTCTTTGC CCCACGATTT GCTTTGTTCT GTTAACCCGA CGCATTACTT TTTCATTCTT ATTTGTGCAA ATAACACGTG GCGGAATATA ACGTCAGAAA GCCTGCATAA ATGGCTGGAA AAAATGAATA AATGGACTCG TTTTCATCAC TGTTCATTGT TGGTTATTAA CCCTTGTAAT AATAGCGATA AACAGTCCTC GTTGTTGATG GGCGAGTATC GCTCACTTTT CGGCCTCGCC AGTTTACGTT TTCAGGGCGA CCAACATTTG TTCGATATTG CCTTCTGGTG TAACGAAAAA GGCGTCAGCG CCCGACAGCA GTTATTGCTG TGTCAGCAGG ACGAACGCTG GACGCTATCC CATCAGGAGG AGACGGCAAT TCAGCCGCGT AGCGACGAAA AACGCATTCT TAGCCACGTC GCCGTCCTTG AAGGCGCGCC GCCGCTCTCG GAACACTGGA CGCTTTTCGA CAATAACGAA GCGCTATTCA ACGACGCGCG CACGGCGCAG GCCGCGACAA TTATTTTTTC GCTTACACAG AACAACCAAA TCGAGCCGCT TGCTCGTCGC ATTCATACTT TGCGGCGCCA GCGGGGAAGC GCGCTGAAAA TTGTCGTGCG CGAAAATATC GCCAGTTTGC GCGCCACCGA TGAGCGCCTG CTGCTGGGCT GCGGCGCGAA TATGATCATT CCCTGGAACG CCCCGCTTTC ACGCTGCCTG ACGCTTATTG AAAGCGTGCA GGGGCAGCAG TTCAGCCGTT ACGTACCGGA AGACATCACC ACGCTACTGT CGATGACGCA GCCGTTGAAA CTGCGCGGTT TTCAGCCGTG GGATACCTTC TGCGATGCCA TCCATACGAT GATGAGCAAC ACCCTGCTCC CCGCCGACGG GAAAGGCGTT CTGGTCGCGC TGCGCCCGGT GCCGGGCATT CGGGTTGAGC AGGCATTAAC ATTATGTCGG CCAAACCGAA CCGGCGATAT TATGACCATC GGCGGCAACC GTCTGGTGCT GTTTTTATCA TTCTGCCGGG TCAACGATCT GGATACCGCG TTAAACCATA TTTTCCCTTT GCCGACGGGC GATATTTTCT CTAATCGTAT GGTCTGGTTC GAAGATAAAC AAATCAGCGC CGAGCTGGTG CAGATGCGCT TATTGTCGCC GGAACTGTGG GGAACGCCGC TACCGCTGGC AAAACGCGCC GACCCGGTAA TAAATGCCGA ACACGATGGC CGCATCTGGC GTCGTATTCC TGAACCCCTG CGACTGCTCG ACGACACCGC GGAGCGTGCA TCATGA
|
Protein sequence | MPTGGVWWVN ADRQQDAISL VNQTIASQTE NANVAVIGME GDPGKVIKLD ESHGPEKIRL FTMPDSEKGL YSLPHDLLCS VNPTHYFFIL ICANNTWRNI TSESLHKWLE KMNKWTRFHH CSLLVINPCN NSDKQSSLLM GEYRSLFGLA SLRFQGDQHL FDIAFWCNEK GVSARQQLLL CQQDERWTLS HQEETAIQPR SDEKRILSHV AVLEGAPPLS EHWTLFDNNE ALFNDARTAQ AATIIFSLTQ NNQIEPLARR IHTLRRQRGS ALKIVVRENI ASLRATDERL LLGCGANMII PWNAPLSRCL TLIESVQGQQ FSRYVPEDIT TLLSMTQPLK LRGFQPWDTF CDAIHTMMSN TLLPADGKGV LVALRPVPGI RVEQALTLCR PNRTGDIMTI GGNRLVLFLS FCRVNDLDTA LNHIFPLPTG DIFSNRMVWF EDKQISAELV QMRLLSPELW GTPLPLAKRA DPVINAEHDG RIWRRIPEPL RLLDDTAERA S
|
| |