Gene SeD_A3999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3999 
Symbol 
ID6871501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3844451 
End bp3846010 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content53% 
IMG OID642786955 
Producthypothetical protein 
Protein accessionYP_002217583 
Protein GI198245216 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.366924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCCCG TATTTTCTCT CGGCATCTCA TCATTATGGG ATGAACTGCG CCATATGCCA 
ACCGGCGGCG TCTGGTGGGT TAACGCCGAT CGCCAGCAAG ATGCCATCAG CCTGGTGAAT
CAAACGATTG CGTCACAAAC GGAGAATGCA AATGTCGCCG TCATCGGCAT GGAAGGCGAT
CCTGGCAAAG TAATCAAATT AGATGAATCT CACGGTCCGG AGAAAATCCG CTTATTTACC
ATGCCGGATT CAGAAAAAGG GCTATACTCT TTGCCCCACG ATTTGCTTTG TTCTGTTAAC
CCGACGCATT ACTTTTTCAT TCTTATTTGT GCAAATAACA CGTGGCGGAA TATAACGTCA
GAAAGCCTGC ATAAATGGCT GGAAAAAATG AATAAATGGA CTCGTTTTCA TCACTGTTCA
TTGTTGGTTA TTAACCCTTG TAATAATAGC GATAAACAGT CCTCGTTGTT GATGGGCGAG
TATCGCTCAC TTTTCGGCCT CGCCAGTTTA CGTTTTCAGG GCGACCAACA TTTGTTCGAT
ATTGCCTTCT GGTGTAACGA AAAAGGCGTC AGCGCCCGAC AGCAGTTATT GCTGTGTCAG
CAGGACGAAC GCTGGACGCT ATCCCATCAG GAGGAGACGG CAATTCAGCC GCGTAGCGAC
GAAAAACGCA TTCTTAGCCA CGTCGCCGTC CTTGAAGGCG CGCCGCCGCT CTCGGAACAC
TGGACGCTTT TCGACAATAA CGAAGCGCTA TTCAACGACG CGCGCACGGC GCAGGCCGCG
ACAATTATTT TTTCGCTTAC ACAGAACAAC CAAATCGAGC CGCTTGCTCG TCGCATTCAT
ACTTTGCGGC GCCAGCGGGG AAGCGCGCTG AAAATTGTCG TGCGCGAAAA TATCGCCAGT
TTGCGCGCCA CCGATGAGCG CCTGCTGCTG GGCTGCGGCG CGAATATGAT CATTCCCTGG
AACGCCCCGC TTTCACGCTG CCTGACGCTT ATTGAAAGCG TGCAGGGACA GCAGTTCAGC
CGTTACGTAC CGGAAGACAT CACCACGCTA CTGTCAATGA CGCAGCCGTT GAAACTGCGC
GGTTTTCAGC CGTGGGATAT CTTCTGCGAT GCCATCCATA CGATGATGAG CAACACCCTG
CTCCCCGCCG ACGGGAAAGG CGTTCTGGTC GCGCTGCGCC CGGTGCCGGG CATTCGGGTT
GAGCAGGCGT TAACATTATG TCGGCCAAAC CGAACCGGCG ATATTATGAC CATCGGCGGC
AACCGTCTGG TGCTGTTTTT ATCATTCTGC CGGGTCAACG ATCTGGATAC CGCGTTAAAC
CATATTTTCC CTTTGCCGAC GGGCGATATT TTCTCTAATC GTATGGTCTG GTTCGAAGAT
AAACAAATCA GCGCCGAGCT GGTGCAGATG CGCTTATTGT CGCCGGAACT GTGGGGAACG
CCGCTACCGC TGGCAAAACG CGCCGACCCG GTAATAAATG CCGAACACGA TGGCCGCATC
TGGCGTCGTA TTCCTGAACC CCTGCGATTG CTCGACGACA CCGCGGAGCG TGCATCATGA
 
Protein sequence
MDPVFSLGIS SLWDELRHMP TGGVWWVNAD RQQDAISLVN QTIASQTENA NVAVIGMEGD 
PGKVIKLDES HGPEKIRLFT MPDSEKGLYS LPHDLLCSVN PTHYFFILIC ANNTWRNITS
ESLHKWLEKM NKWTRFHHCS LLVINPCNNS DKQSSLLMGE YRSLFGLASL RFQGDQHLFD
IAFWCNEKGV SARQQLLLCQ QDERWTLSHQ EETAIQPRSD EKRILSHVAV LEGAPPLSEH
WTLFDNNEAL FNDARTAQAA TIIFSLTQNN QIEPLARRIH TLRRQRGSAL KIVVRENIAS
LRATDERLLL GCGANMIIPW NAPLSRCLTL IESVQGQQFS RYVPEDITTL LSMTQPLKLR
GFQPWDIFCD AIHTMMSNTL LPADGKGVLV ALRPVPGIRV EQALTLCRPN RTGDIMTIGG
NRLVLFLSFC RVNDLDTALN HIFPLPTGDI FSNRMVWFED KQISAELVQM RLLSPELWGT
PLPLAKRADP VINAEHDGRI WRRIPEPLRL LDDTAERAS