Gene SNSL254_A3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3895 
Symbol 
ID6483353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3779283 
End bp3780788 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content53% 
IMG OID642739159 
Productputative cellulose biosynthesis protein BcsE 
Protein accessionYP_002042870 
Protein GI194442900 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCG GCGGCGTCTG GTGGGTTAAC GCCGATCGCC AGCAAGATGC CATCAGTCTG 
GTGAATCAAA CGATTGCGTC ACAAACGGAG AATGCAAATG TCGCCGTCAT CGGCATGGAA
GGCGATCCTG GCAAAGTAAT CAAATTAGAT GAATCTCACG GTCCGGAGAA AATCCGCTTA
TTTACCATGC CGGATTCAGA AAAAGGGCTA TACTCTTTGC CCCACGATTT GCTTTGTTCT
GTTAACCCGA CGCATTACTT TTTCATTCTT ATTTGTGCAA ATAACACGTG GCGGAATATA
ACGTCAGAAA GCCTGCATAA ATGGCTGGAA AAAATGAATA AATGGACTCG TTTTCATCAC
TGTTCATTGT TGGTTATTAA CCCTTGTAAT AATAGCGATA AACAGTCCTC GTTGTTGATG
GGCGAGTATC GCTCACTTTT CGGCCTCGCC AGTTTACGTT TTCAGGGCGA CCAACATTTG
TTCGATATTG CCTTCTGGTG TAACGAAAAA GGCGTCAGCG CCCGACAGCA GTTATTGCTG
TGTCAGCAGG ACGAACGCTG GACGCTATCC CATCAGGAGG AGACGGCAAT TCAGCCGCGT
AGCGACGAAA AACGCATTCT TAGCCACGTC GCCGTCCTTG AAGGCGCGCC GCCGCTCTCG
GAACACTGGA CGCTTTTCGA CAATAACGAA GCGCTATTCA ACGACGCGCG CACGGCGCAG
GCCGCGACAA TTATTTTTTC GCTTACACAG AACAACCAAA TCGAGCCGCT TGCTCGTCGC
ATTCATACTT TGCGGCGCCA GCGGGGAAGC GCGCTGAAAA TTGTCGTGCG CGAAAATATC
GCCAGTTTGC GCGCCACCGA TGAGCGCCTG CTGCTGGGCT GCGGCGCGAA TATGATCATT
CCCTGGAACG CCCCGCTTTC ACGCTGCCTG ACGCTTATTG AAAGCGTGCA GGGGCAGCAG
TTCAGCCGTT ACGTACCGGA AGACATCACC ACGCTACTGT CGATGACGCA GCCGTTGAAA
CTGCGCGGTT TTCAGCCGTG GGATACCTTC TGCGATGCCA TCCATACGAT GATGAGCAAC
ACCCTGCTCC CCGCCGACGG GAAAGGCGTT CTGGTCGCGC TGCGCCCGGT GCCGGGCATT
CGGGTTGAGC AGGCATTAAC ATTATGTCGG CCAAACCGAA CCGGCGATAT TATGACCATC
GGCGGCAACC GTCTGGTGCT GTTTTTATCA TTCTGCCGGG TCAACGATCT GGATACCGCG
TTAAACCATA TTTTCCCTTT GCCGACGGGC GATATTTTCT CTAATCGTAT GGTCTGGTTC
GAAGATAAAC AAATCAGCGC CGAGCTGGTG CAGATGCGCT TATTGTCGCC GGAACTGTGG
GGAACGCCGC TACCGCTGGC AAAACGCGCC GACCCGGTAA TAAATGCCGA ACACGATGGC
CGCATCTGGC GTCGTATTCC TGAACCCCTG CGACTGCTCG ACGACACCGC GGAGCGTGCA
TCATGA
 
Protein sequence
MPTGGVWWVN ADRQQDAISL VNQTIASQTE NANVAVIGME GDPGKVIKLD ESHGPEKIRL 
FTMPDSEKGL YSLPHDLLCS VNPTHYFFIL ICANNTWRNI TSESLHKWLE KMNKWTRFHH
CSLLVINPCN NSDKQSSLLM GEYRSLFGLA SLRFQGDQHL FDIAFWCNEK GVSARQQLLL
CQQDERWTLS HQEETAIQPR SDEKRILSHV AVLEGAPPLS EHWTLFDNNE ALFNDARTAQ
AATIIFSLTQ NNQIEPLARR IHTLRRQRGS ALKIVVRENI ASLRATDERL LLGCGANMII
PWNAPLSRCL TLIESVQGQQ FSRYVPEDIT TLLSMTQPLK LRGFQPWDTF CDAIHTMMSN
TLLPADGKGV LVALRPVPGI RVEQALTLCR PNRTGDIMTI GGNRLVLFLS FCRVNDLDTA
LNHIFPLPTG DIFSNRMVWF EDKQISAELV QMRLLSPELW GTPLPLAKRA DPVINAEHDG
RIWRRIPEPL RLLDDTAERA S