Gene SNSL254_A3890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A3890 
SymbolbcsB 
ID6484189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp3773084 
End bp3775363 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content57% 
IMG OID642739154 
Productcellulose synthase regulator protein 
Protein accessionYP_002042865 
Protein GI194443536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGCGG CGGTAATAGG ATTAAGCGCG TTTCCTGCTT TCATGACGGC GGCGGCGCCT 
GCGATGCCGC CATTGATCAA TGCTGAACCC ACCGAGCCTG CGCAGTCGCC CGCAGATCAG
GCGCCAGTCG TGGCGCAGAC CGCGCCTTCG CGCGAGGTCA AGCTGACCTT TGCGCAAATC
GCGCCGCCGC CGGGTAGTAT GGCGCTGCGT GGCGTTAACC CTAACGGCGG CATTGAATTT
GGTATGCGCA GCGATGAAGT GGCGTCGAAA GCGGTGCTGA ATCTGGAATA TACGCCCTCG
CCGTCGCTCC TGCCGGTTCA GTCGCAGCTC AAGGTTTATC TCAATGATGA ACTGATGGGC
GTACTGCCGG TGACAAAAGA GCAGTTGGGG AAAAAGACGC TGGCGCAGGT ACCTATCAAT
CCGCTATTTA TCACCGACTT TAACCGGGTG CGGCTGGAGT TTGTCGGCCA CTATCGCGAC
GTGTGTGAAA ACCCGGCCAG CAGTACTCTG TGGTTAGACA TCGGGCGAAA TAGCGCCCTG
GATCTGACCT ATAACATGCT GGCGGTGAAT AACGATCTGT CCCACTTCCC GGTGCCGTTT
TTCGATCCGC GGGATAACCG TCCGGTGACG TTGCCGATAG TGTTTGCTGA CATGCCGGAT
CTGGCGCAGC AGCAGGCGGC TTCTATTGTC GCGTCCTGGT TTGGCTCGCG GGCGGGCTGG
CGCGGTCAGC GCTTCCCGGT GTTGTATAAT CACCTGCCGG ATCGCAATGC AATCGTGTTC
GCCACCAACG ATCGACGCCC CGATTTCCTG CGCGATCATC CTGCGGTTAA CGCGCCGGTT
ATCGAGATGA TGAGCCATCC GGATAATCCG TATGTGAAGT TGCTGGTCGT GTTTGGTCGA
GATGATAAAG ACCTGTTGCA GGCGGCAAAA GGTATCGCGC AAGGGAATAT TCTCTTCCGT
GGTTCCAGCG TGGTGGTCAA CGATGTAAAA CCTCTGCTGG CGCGCAAACC GTATGATGCG
CCGAACTGGG TGCGTACCGA TCGCCCGGTC ACTTTTGGCG AGCTGAAAAC CTATGAAGAG
CAGCTCCAGT CGAGTGGGCT GGAGCCGGCG CCCATCAACG TTTCTTTGAA TCTGCCGCCG
GACCTCTATT TGCTGCGTAG CAACGGTATT GATATGGATC TCAACTACCG TTATACCTCG
CCGCCGATCA AAGACAGTTC ACGGCTGGAT ATCAGTCTGA ATAACCAGTT CCTGCAAGCC
TTTAGCCTTA ACAGCACGCA GGAAACTAAT CGACTCCTGT TGCGCCTGCC GGTACTTCAG
GGACTGCTGG ATGGTAAAAC GGATGTGTCT ATTCCGGCGC TCAAACTGGG GGCGATGAAC
CAACTACGTT TTGACTTCCG CTACATGAAT CCGATGCCGG GCGGGTCGGT GGACAACTGT
ATTACCTTCC AGCCGGTACC GAATCATGTG GTGATAGGGG ATGACTCCAC TATCGATTTT
TCGAAATATT ACCACTTTAT CGCGATGCCG GATTTACGCG CGTTCGCCAA TGCGGGTTTC
CCGTTCAGCC GGATGGCCGA CTTGTCTGAC ACGCTGGCGG TGATGCCGAA GACCCCAACC
GAAGCGCAAA TGGAAACGCT GCTGAATACG GTCGGCGCCA TTGGCGGGCA GACCGGTTTC
CCGGCAATTA ATCTGACAAT CACCGATGAT AGCGCTCAGA TAGCCGACAA AGACGCCGAT
CTGCTGATTA TTGGCGCTAT TCCGGGCAAG CTAAAAGACG ATAAGCGTAT CGATCTGTTG
GTGCAGGCGA CACAAAGCTG GGTAAAAACC CCGATGCGGC AGACCGCTTT CCCGTCGATT
ATGCCGGATG AGGCCGATCG CGCGGCGGAT GCACAGTCCA CCGTCACCGC CAGCGGCCCG
ATGGCGGCGG TGGTGGGCTT CCAGTCGCCG TTTAATGATC AGCGCAGCGT GATTGCTCTG
CTGGCTGACA GCCCGCGCGG CTACCAGCTA CTGAACGACG CCGTGAACGA CAGCGGTAAA
CGCGCCGCGA TGTTTGGTTC CGTGGCGGTG ATCCGCGAGT CCGGCGTTCA CAGTCTGCGC
GTTGGCGATA TCTATTACGT CGGACATCTG CCGTGGTTTG AGCGGCTGTG GTATGCGCTG
GCGAATCACC CGGTGCTGCT GGCGGTACTG GCGGCCCTCA GTGTGGTATT ACTGGCGTGG
GTATTGTGGC GTCTGCTACG TATTCTCAGT CGCCGTCGTC TCGACCCTGA CCATGAGTAA
 
Protein sequence
MCAAVIGLSA FPAFMTAAAP AMPPLINAEP TEPAQSPADQ APVVAQTAPS REVKLTFAQI 
APPPGSMALR GVNPNGGIEF GMRSDEVASK AVLNLEYTPS PSLLPVQSQL KVYLNDELMG
VLPVTKEQLG KKTLAQVPIN PLFITDFNRV RLEFVGHYRD VCENPASSTL WLDIGRNSAL
DLTYNMLAVN NDLSHFPVPF FDPRDNRPVT LPIVFADMPD LAQQQAASIV ASWFGSRAGW
RGQRFPVLYN HLPDRNAIVF ATNDRRPDFL RDHPAVNAPV IEMMSHPDNP YVKLLVVFGR
DDKDLLQAAK GIAQGNILFR GSSVVVNDVK PLLARKPYDA PNWVRTDRPV TFGELKTYEE
QLQSSGLEPA PINVSLNLPP DLYLLRSNGI DMDLNYRYTS PPIKDSSRLD ISLNNQFLQA
FSLNSTQETN RLLLRLPVLQ GLLDGKTDVS IPALKLGAMN QLRFDFRYMN PMPGGSVDNC
ITFQPVPNHV VIGDDSTIDF SKYYHFIAMP DLRAFANAGF PFSRMADLSD TLAVMPKTPT
EAQMETLLNT VGAIGGQTGF PAINLTITDD SAQIADKDAD LLIIGAIPGK LKDDKRIDLL
VQATQSWVKT PMRQTAFPSI MPDEADRAAD AQSTVTASGP MAAVVGFQSP FNDQRSVIAL
LADSPRGYQL LNDAVNDSGK RAAMFGSVAV IRESGVHSLR VGDIYYVGHL PWFERLWYAL
ANHPVLLAVL AALSVVLLAW VLWRLLRILS RRRLDPDHE