Gene SeHA_C3932 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3932 
Symbol 
ID6491469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3807011 
End bp3810463 
Gene Length3453 bp 
Protein Length1150 aa 
Translation table11 
GC content60% 
IMG OID642744038 
Productcellulose synthase subunit BcsC 
Protein accessionYP_002047644 
Protein GI194451942 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.903843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGGGTT TATCGCTTGG TATGGCGCTA ACACCGCTTG CCGGCGCAGC GACCTCCGCG 
CAGCAACAGT TGCTGGAGCA GGTTCGGCTG GGCGAGGCCA CGCACCGTGA GGACTTAGTC
CGTCAGTCGC TCTATCGCCT GGAGCTGATC GACCCCAATG ATCCACAGGT TATCGCCGCC
CGTTTCCGCT ATCTGTTGCG GCAGGGGGAT AGCGACGGGG CGCAGAAGTT ACTTGACCGG
CTGGCGCAAC TGGCGCCGGA GTCGACGGCG TATCAATCTT CCCGCACCGC GATGCTGCTC
TCCACGCCGC AAGGACGCCA GTCTTTGCAG GAGGCGCGTT TACTGGCGAC GACCGGCCAT
ACTGAACAAG CGATCGCCAG CTACGACAAG CTGTTTAAAG GTTATCCGCC GGAGGGCGAA
CTGGCGGTCG AATACTGGAC GACCGTGGCG AAATTGCCCG CCCGCCGTCA CGAAGCGATT
AACCAGCTAC AGAAAATCAA TGCCGTCAGT CCGGGTAATA ACGCTCTGCA AAATGCGCTG
GCGCAACTGT TGTTCGCCAG CGGGCGGCGC GATGAGGGAT TCGCGGTGCT AAAACAGATG
GCGAAATCCA GTACGGGACG CAGCGCGGCC TCCGCCATCT GGTACCAGCA GATAAAAGAT
CTCCCGGTGA GCGACGCCAG CGTAAAAGCG TTGCAAGACT ATCTGACGCA GTTTAGCGAA
GGCGATAGCG TGTCTGCCGC CCGCGCCCAG CTTAGCGAGC AGCAAAAACA GTTAGCCGAT
CCAGCGTTCC GCGCGCGCTC GCAGGGCATC GCGGCGGTTA ATGCCGGAGA AGGTGGTAAG
GCTATTGTGC AATTGCAGCA GGCGGTGAGC GCCCGGCAGG ACGACAGCGA GGCGGTCGGC
GCGCTGGGGC AGGCATACTC ACAGAGTGGC GATCGCGCCC GCGCCGTCGC GCAGTTTGAA
AAGGCGCTGG CGATGGCGCC GCACAGCAGC AGCCGCGATA AGTGGGAGAG TCTGCTGAAG
GTCAATCGCT ACTGGCTGTT AATTCAGCAG GGCGACGCGG CCTTAAAAGC GAATAATCTG
GCCCAGGCGG AGCGTTTCTA TCAGCAGGCG CGAGCAGTGG ATAACACCGA CAGCTACGCG
GTGCTGGGGC TGGGGGATGT GGCGATGGCG CGCAAAGATA ATGCCGCCGC CGAACGTTAT
TATCAGCAGA CGCTGCGTAT GGACAGCGGT AATACCAATG CTGTACGCGG GCTGGCGAAT
CTTTATCGCC AGCAGTCGCC GCAAAAAGCC GCCGCGTTTA TCGCTTCTCT TTCCGCCAGC
CAGCGGCGCA GTATCGACGA TATCGAACGC AGTCTGGAAA ATGACCGTCT GGCGCAGCAG
GCGGAAACGC TGGAAAGCGA GGGCAAATGG GCGCAGGCCG CAGAACTGCA CCGTCGTCGG
CTGGCATTAG ATCCGGGGAG CGTGTGGGTA ACGTACCGAC TGTCACGCGA TCTGTGGCAG
GCCGGGCAGC ACGCCCAGGC CGATGCGCAA ATGCGCTCTC TGGCGCAGCA GAAGCCAAAC
GATCCGGAAC AGGTCTATGC TTATGGGCTT TATCTTTCCG GCAGCGATCG GGACCGGGCG
GCGCTGGCGC ATCTCAATAC CCTACCGACC AGCCAGTGGA ACAGCAATAT TCAGGAACTG
GCGGGCCGAT TGCAAAGTAA CCAGGTGCTG GAAAGCGCTA ACCGCTTGCG CGATAGCGGC
AAAGAACGCG AAGCGGAAGC GTTGTTACGT CAGCAGCCGC CCTCTACGCG CATTGCGTTA
ACGTTGGCGG ACTGGGCGCA GCAGCGTGGC GATAATGCGG CGGCCCGCGC CGCTTATGAC
GCCGTTCTGG CGCGGGAACC GGGTAATGTC GATGCCATGC TGGGGCGGGT GGAAATCGAC
ATCGCACAGG GCGATAACGC TGCGGCGCGC GCTCAACTGG CGGCGCTGCC TGCGTCGCAA
ATCACCTCTC TTAACATGCA GCGCCGCGTC GCGCTGGCGC AGCTCCAGCT TGGCGATATC
ACGGCGGCGG CGCGGATCTT TAACCGCATT ACGCCGCAGG CAAAAGCACA GCCGCCATCA
ATGGAAAGCG CAATGGTATT ACGTGACGCC GCCGCTTTTC AGGCGCAAAC GGGCGAGCCG
CAGCGGGCGC TGGAGACCTA CAAAGAGGCA ATGGTCGCCG CGGCGATTAC GCCGGTTCGT
CCCCAGGATA ACGATACCTT TACCCGCCTG ACGCGCAATG ATGAAAAAGA TGACTGGCTA
AAACGCGGCG TGCGTAGCGA TGCGGCGGAG TTGTACCGTC AGCAGGATCT CAATGTCACG
TTGGCGCACG ATTATTGGGG GTCGAGCGGA ACTGGCGGCT ACTCCGATCT GAAGGCACAT
ACCACGATGC TTCAGGTGGA TGCGCCCTGG TCGGACGGAC GGGCGTTCTT TCGTACTGAT
ATGGTGAATA TGGATGTTGG CCGCTTCTCT ACGGATGCGG ATGGAAAATA CGATAATAAC
TGGGGTACCT GTACGCTGGA GAAATGCAGC GGACATCGTA GCCAGGCCGA TACGGGCGCG
AGCGTGGCGG TCGGCTGGCA GAATGAGACC TGGCGCTGGG ATATCGGCAC GACGCCGATG
GGCTTTAATG TCGTTGATGT GGTCGGCGGC GTCAGCTATA GCGACGATAT CGGGCCGTTG
GGTTATACCC TGAACGCGCA TCGTCGCCCG ATCTCCAGCT CGCTGCTGGC GTTTGGCGGG
CAAAAGGATG CCAGCAGCAA TACCGGCACC AAATGGGGCG GCGTGCGGGC CAACGGCGGC
GGCGTCAGTC TCAGCTATGA TAAAGGCGAA GCAAACGGTG TCTGGGCGTC GCTCAGCGGC
GACCAGTTGA GCGGTAAAAA TGTGGAAGAT AACTGGCGCG TGCGCTGGAT GACCGGTTAT
TACTATAAGG TGATTAACGA GAATAACCGC CGCGTTACCG TCGGGCTGAA TAACATGATC
TGGCATTACG ACAAAGATCT GAGCGGTTAT TCACTGGGTC AGGGCGGTTA TTATAGCCCG
CAGGAATACC TGTCGTTTGC GGTGCCGGTG ATGTGGCGGC AGCGTACGGA AAACTGGTCG
TGGGAGCTAG GCGGCTCGGT ATCCTGGTCG CACTCCCGCA CCCGTACCAT GCCGCGTTAT
CCGCTGATGA ATTTGATCCC GGCGGATTAT CAGGAGGATG CGCGTGACCA GACCAACGTC
GGCGGCAGCA GTCAGGGATT TGGCTATACC GCGCGGGCGC TCATTGAACG CCGGGTCACT
GCCAACTGGT TTGTGGGTAC GGCTGTCGAT ATTCAGCAGG CGAAAGACTA TACCCCCAGT
CATCTGCTGC TGTATGTCCG TTATTCCGCA GCGGGCTGGC AGGGGGATAT GGATTTACCG
CCGCAGCCTC TGGTGCCTTA CGCTGACTGG TAA
 
Protein sequence
MLGLSLGMAL TPLAGAATSA QQQLLEQVRL GEATHREDLV RQSLYRLELI DPNDPQVIAA 
RFRYLLRQGD SDGAQKLLDR LAQLAPESTA YQSSRTAMLL STPQGRQSLQ EARLLATTGH
TEQAIASYDK LFKGYPPEGE LAVEYWTTVA KLPARRHEAI NQLQKINAVS PGNNALQNAL
AQLLFASGRR DEGFAVLKQM AKSSTGRSAA SAIWYQQIKD LPVSDASVKA LQDYLTQFSE
GDSVSAARAQ LSEQQKQLAD PAFRARSQGI AAVNAGEGGK AIVQLQQAVS ARQDDSEAVG
ALGQAYSQSG DRARAVAQFE KALAMAPHSS SRDKWESLLK VNRYWLLIQQ GDAALKANNL
AQAERFYQQA RAVDNTDSYA VLGLGDVAMA RKDNAAAERY YQQTLRMDSG NTNAVRGLAN
LYRQQSPQKA AAFIASLSAS QRRSIDDIER SLENDRLAQQ AETLESEGKW AQAAELHRRR
LALDPGSVWV TYRLSRDLWQ AGQHAQADAQ MRSLAQQKPN DPEQVYAYGL YLSGSDRDRA
ALAHLNTLPT SQWNSNIQEL AGRLQSNQVL ESANRLRDSG KEREAEALLR QQPPSTRIAL
TLADWAQQRG DNAAARAAYD AVLAREPGNV DAMLGRVEID IAQGDNAAAR AQLAALPASQ
ITSLNMQRRV ALAQLQLGDI TAAARIFNRI TPQAKAQPPS MESAMVLRDA AAFQAQTGEP
QRALETYKEA MVAAAITPVR PQDNDTFTRL TRNDEKDDWL KRGVRSDAAE LYRQQDLNVT
LAHDYWGSSG TGGYSDLKAH TTMLQVDAPW SDGRAFFRTD MVNMDVGRFS TDADGKYDNN
WGTCTLEKCS GHRSQADTGA SVAVGWQNET WRWDIGTTPM GFNVVDVVGG VSYSDDIGPL
GYTLNAHRRP ISSSLLAFGG QKDASSNTGT KWGGVRANGG GVSLSYDKGE ANGVWASLSG
DQLSGKNVED NWRVRWMTGY YYKVINENNR RVTVGLNNMI WHYDKDLSGY SLGQGGYYSP
QEYLSFAVPV MWRQRTENWS WELGGSVSWS HSRTRTMPRY PLMNLIPADY QEDARDQTNV
GGSSQGFGYT ARALIERRVT ANWFVGTAVD IQQAKDYTPS HLLLYVRYSA AGWQGDMDLP
PQPLVPYADW