Gene SeHA_C3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3939 
Symbol 
ID6490839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3817793 
End bp3819352 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content53% 
IMG OID642744045 
Productputative cellulose biosynthesis protein BcsE 
Protein accessionYP_002047651 
Protein GI194451919 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.636302 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCCCG TATTTTCTCT CGGCATCTCA TCATTATGGG ATGAACTGCG CCATATGCCA 
ACCGGCGGCG TCTGGTGGGT TAACGCCGAT CGCCAGCAAG ATGCCATCAG CCTGGTGAAT
CAAACGATTG CGTCACAAAC GGAGAATGCA AATGTCGCCG TCATCGGCAT GGAAGGCGAT
CCTGGCAAAG TAATCAAATT AGATGAATCT CACGGTCCGG AGAAAATCCG CTTATTTACC
ATGCCGGATT CAGAAAAAGG GCTATACTCT TTGCCCCACG ATTTGCTTTG TTCTGTTAAC
CCGACGCATT ACTTTTTCAT TCTTATTTGT GCAAATAACA CGTGGCGGAA TATAACGTCA
GAAAGCCTGC ATAAATGGCT GGAAAAAATG AATAAATGGA CTCGCTTTCA TCGCTGTTCA
TTGTTGGTTA TTAACCCTTG TAATAATAGC GATAAACAGT CCTCGTTGTT GATGGGCGAG
TATCGCTCAC TTTTCGGCCT CGCCAGTTTA CGTTTTCAGG GCGACCAACA TTTGTTCGAT
ATTGCCTTCT GGTGTAACGA AAAAGGCGTC AGCGCCCGAC AGCAGTTATT GCTGTGTCAG
CAGGACGAAC GCTGGACGCT ATCCCATCAG GAGGAGACGG CAATTCAGCC GCGTAGCGAC
GAAAAACGCA TTCTTAGCCA CGTCGCCGTC CTTGAAGGCG CGCCGCCGCT CTCGGAACAC
TGGACGCTTT TCGACAATAA CGAAGCGCTA TTCAACGACG CGCGCACGGC GCAGGCCGCG
ACAATTATTT TTTCGCTTAC ACAGAACAAC CAAATCGAGC CGCTTGCTCG TCGCATTCAT
ACTTTGCGGC GCCAGCGGGG AAGCGCGCTG AAAATTGTCG TGCGCGAAAA TATCGCCAGT
TTGCGCGCCA CCGATGAGCG CCTGCTGCTG GGCTGCGGCG CGAATATGAT CATTCCCTGG
AACGCCCCGC TTTCACGCTG CCTGACGCTT ATTGAAAGCG TGCAGGGGCA GCAGTTCAGC
CGTTACGTAC CGGAAGACAT CACCACGCTA CTGTCGATGA CGCAGCCGTT GAAACTGCGC
GGTTTTCAGC CGTGGGATAC CTTCTGCGAT GCCATCCATA CGATGATGAG CAACACCCTG
CTCCCCGCCG ACGGGAAAGG CGTTCTGGTC GCGCTGCGCC CGGTGCCGGG CATTCGGGTT
GAGCAGGCGT TGACATTATG TCGACCAAAC CGAACCGGCG ATATTATGAC CATCGGCGGC
AACCGTCTGG TGCTGTTTTT ATCATTCTGC CGGGTCAACG ATCTGGATAC CGCGTTAAAC
CATATTTTCC CTTTGCCGAC GGGCGATATT TTCTCTAATC GTATGGTCTG GTTCGAAGAT
AAACAAATCA GCGCCGAGCT GGTGCAGATG CGCTTATTGT CGCCGGAACT GTGGGGAACG
CCGCTACCGC TGGCAAAACG CGCCGACCCG GTAATAAACG CCGAACACGA TGGCCGCATC
TGGCGTCGTA TTCCTGAACC CCTGCGACTG CTCGACGACA CCGCGGAGCG TGCATCATGA
 
Protein sequence
MDPVFSLGIS SLWDELRHMP TGGVWWVNAD RQQDAISLVN QTIASQTENA NVAVIGMEGD 
PGKVIKLDES HGPEKIRLFT MPDSEKGLYS LPHDLLCSVN PTHYFFILIC ANNTWRNITS
ESLHKWLEKM NKWTRFHRCS LLVINPCNNS DKQSSLLMGE YRSLFGLASL RFQGDQHLFD
IAFWCNEKGV SARQQLLLCQ QDERWTLSHQ EETAIQPRSD EKRILSHVAV LEGAPPLSEH
WTLFDNNEAL FNDARTAQAA TIIFSLTQNN QIEPLARRIH TLRRQRGSAL KIVVRENIAS
LRATDERLLL GCGANMIIPW NAPLSRCLTL IESVQGQQFS RYVPEDITTL LSMTQPLKLR
GFQPWDTFCD AIHTMMSNTL LPADGKGVLV ALRPVPGIRV EQALTLCRPN RTGDIMTIGG
NRLVLFLSFC RVNDLDTALN HIFPLPTGDI FSNRMVWFED KQISAELVQM RLLSPELWGT
PLPLAKRADP VINAEHDGRI WRRIPEPLRL LDDTAERAS