Gene EcE24377A_4018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4018 
SymbolbcsC 
ID5590269 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3997563 
End bp4000985 
Gene Length3423 bp 
Protein Length1140 aa 
Translation table11 
GC content58% 
IMG OID640927639 
Productcellulose synthase subunit BcsC 
Protein accessionYP_001465000 
Protein GI157154865 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATGG TCGAGGCAGC ACCAACCGCT CAGCAACAGT TGCTGGAGCA AGTTCGGTTA 
GGCGAAGCGA CCCATCGTGA AGATCTGGTG CAACAGTCGT TATATCGGCT GGAACTTATT
GATCCGAATA ACCCGGACGT CGTTGCCGCC CGTTTCCGTT CTTTGTTACG TCAGGGCGAT
ATTGATGGCG CGCAAAAACA GCTCGATCGG CTGTCGCAGT TAGCGCCGAG TTCAAATGCG
TATAAATCGT CGCGGACTAC GATGCTACTT TCCACGCCGG ATGGTCGTCA GGCACTGCAA
CAGGCACGAT TGCAGGCGAC GACCGGTCAT GCAGAAGAAG CTGTGGCGAG TTACAACAAA
CTGTTCAACG GTGCGCCGCC GGAAGGTGAC ATTGCTGTCG AGTACTGGAG TACGGTGGCG
AAAATTCCGG CTCGCCGTGG CGAAGCGATT AATCAGTTAA AACGCATCAA TGCGGATGCA
CCGGGCAATA CGGGCCTGCA AAACAATCTG GCGCTATTGC TGTTTAGTAG CGATCGCCGT
GACGAAGGTT TTGCCGTCCT GGAACAGATG GCAAAATCGA ACGCCGGGCG CGAAGGGGCC
TCTAAAATCT GGTACGGGCA GATTAAAGAC ATGCCCGTCA GTGATGCCAG TGTGTCGGCG
CTGAAAAAAT ATCTCTCGAT CTTTAGTGAT GGCGATAGCG TGGCGGCTGC GCAATCGCAA
CTGGCAGAAC AGCAAAAACA GCTGGCCGAT CCTGCTTTCC GCGCTCGTGC GCAAGGTTTA
GCGGCGGTGG ACTCTGGTAT GGCGGGTAAA GCCATTCCCG AACTACAACA GGCGGTGCGG
GCGAACCCGA AAGACAGTGA AGCTCTGGGG GCGCTGGGCC AGGCGTATTC TCAGAAAGGC
GATCGCGCCA ATGCAGTGGC GAATCTGGAA AAAGCCCTCG CACTGGACCC GCACAGCAGC
AACAACGACA AATGGAACAG TCTGCTGAAA GTAAACCGCT ACTGGCTGGC GATCCAGCAG
GGCGATGCTG CGCTGAAAGC CAATAATCCT GACCGGGCAG AACGCCTGTT CCAGCAGGCG
CGTAATGTCG ATAACACCGA CAGTTATGCA GTGCTGGGGC TGGGCGATGT GGCGATGGCG
CGAAAAGATT ATCCCGCCGC CGAACGTTAT TATCAGCAGA CCTTGCGTAT GGACAGCGGC
AACACTAACG CCGTGCGCGG GCTGGCAAAT ATTTACCGCC AGCAATCGCC AGAAAAAGCT
GAAGCGTTTA TCGCCTCGCT CTCTGCCAGT CAGCGGCGTA GCATTGATGA TATCGAACGC
AGCCTGCAAA ACGACCGTCT GGCACAGCAG GCAGAGGCAC TGGAAAACCA GGGCAAATGG
GCGCAGGCGG CAGCACTTCA GCGGCAACGA CTGGCGCTGG ACCCCGGCAG CGTATGGATT
ACTTACCGAC TTTCGCAGGA TCTCTGGCAG GCCGGACAAC GCAGCCAGGC CGATACGTTA
ATGCGCAATC TGGCGCAGCA GAAGCCGAAC GACCCGGAGC AGGTTTACGC TTACGGGCTG
TACCTCTCTG GTCATGACCA CGACAGAGCG GCGCTGGCGC ATATCAACAG CCTGCCGCGC
GCGCAGTGGA ACAGCAATAT TCAGGAGCTG GTTAATCGAC TGCAAAGCGA TCAGGTGCTG
GAAACCGCTA ATCGCCTGCG AGAAAGCGGC AAAGAAGCTG AAGCGGAAGC GATGCTGCGT
CAGCAACCAC CTTCCACGCG TATTGACCTC ACGCTGGCTG ACTGGGCGCA GCAGCGACGT
GATTACACCG CCGCCCGCGC TGCATATCAG AATGTCCTGA CGCGGGAGCC AACTAACGCC
GATGCCATTC TGGGGCTGAC GGAAGTGGAT ATTGCTGCCG GTGACACAGC GGCGGCACGT
AGCCAGCTGG CGAAACTGCC CGCTACCGAT AACGCCTCGC TGAACACACA GCGGCGCGTG
GCGCTGGCGC AGGCGCAGCT TGGAGATACC GCAGCGGCGC AGCAGACGTT TAATAAGTTG
ATCCCGCAGG CAAAATCTCA GCCACCGTCG ATGGAAAGCG CGATGGTGCT GCGCGACGGT
GCGAAGTTTG AAGCGCAGGC GGGCGATCCA AAGCAGGCGC TGGAAACCTA CAAAGACGCC
ATGGTCGCAT CCGGTGTGAC TACGACGCGT CCGCAGGATA ACGACACATT CACCCGACTG
ACCCGTAACG ACGAGAAAGA TGACTGGCTG AAACGCGGCG TGCGCAGCGA TGCGGCGGAC
CTCTATCGCC AGCAGGATCT TAACGTCACC CTTGAGCATG ATTATTGGGG ATCGAGCGGC
ACCGGTGGTT ACTCCGATCT GAAAGCGCAC ACCACTATGT TGCAGGTGGA CGCGCCGTAT
TCTGACGGGC GGATGTTCTT TCGCAGTGAT TTCGTCAATA TGAACGTCGG CAGTTTCTCC
ACTAATGCCG ATGGTAAATG GGATGATAAC TGGGGCACCT GTACATTACA GGACTGTAGC
GGCAACCGCA GCCAGTCGGA TTCCGGTGCC AGCGTGGCGG TCGGCTGGCG AAATGACGTC
TGGAGCTGGG ATATCGGCAC CACGCCGATG GGCTTCAACG TGGTGGATGT GGTCGGCGGC
ATCAGTTACA GCGATGATAT CGGGCCGCTG GGTTACACCG TTAACGCCCA CCGTCGGCCC
ATCTCCAGTT CTTTGCTGGC CTTTGGTGGG CAAAAAGACT CCCCGAGCAA TACCGGGAAA
AAATGGGGCG GTGTGCGTGC CGACGGCGTG GGGCTAAGCC TGAGCTACGA TAAAGGTGAA
GCAAACGGCG TCTGGGCATC GCTTAGTGGC GACCAGTTAA CCGGTAAAAA TGTCGAAGAT
AACTGGCGCG TGCGCTGGAT GACGGGCTAT TACTATAAGG TCATCAACCA GAACAATCGC
CGCGTCACAA TCGGCCTGAA CAACATGATC TGGCATTACG ACAAAGACCT GAGTGGCTAC
TCACTCGGTC AGGGCGGTTA CTACAGCCCG CAGGAATACC TGTCGTTTGC CATACCGGTG
ATGTGGCGGG AGCGCACGGA AAACTGGTCG TGGGAGCTGG GGGCGTCTGG CTCGTGGTCG
CATTCACGCA CCAAAACCAT GCCGCGTTAT CCGCTGATGA ATCTGATCCC GACCGACTGG
CAGGAAGAAG CTGCGCGGCA ATCCAACGAT GGCGGCAGCA GTCAGGGCTT CGGCTACACG
GCGCGGGCAT TACTTGAACG ACGTGTTACT TCCAACTGGT TTGTTGGCAC GGCAATTGAT
ATCCAGCAGG CGAAAGATTA CGCACCCAGC CATTTCCTGC TCTACGTACG TTATTCCGCC
GCCGGATGGC AGGGTGACAT GGATTTACCG CCGCAGCCGC TGATACCTTA CGCCGACTGG
TAA
 
Protein sequence
MPMVEAAPTA QQQLLEQVRL GEATHREDLV QQSLYRLELI DPNNPDVVAA RFRSLLRQGD 
IDGAQKQLDR LSQLAPSSNA YKSSRTTMLL STPDGRQALQ QARLQATTGH AEEAVASYNK
LFNGAPPEGD IAVEYWSTVA KIPARRGEAI NQLKRINADA PGNTGLQNNL ALLLFSSDRR
DEGFAVLEQM AKSNAGREGA SKIWYGQIKD MPVSDASVSA LKKYLSIFSD GDSVAAAQSQ
LAEQQKQLAD PAFRARAQGL AAVDSGMAGK AIPELQQAVR ANPKDSEALG ALGQAYSQKG
DRANAVANLE KALALDPHSS NNDKWNSLLK VNRYWLAIQQ GDAALKANNP DRAERLFQQA
RNVDNTDSYA VLGLGDVAMA RKDYPAAERY YQQTLRMDSG NTNAVRGLAN IYRQQSPEKA
EAFIASLSAS QRRSIDDIER SLQNDRLAQQ AEALENQGKW AQAAALQRQR LALDPGSVWI
TYRLSQDLWQ AGQRSQADTL MRNLAQQKPN DPEQVYAYGL YLSGHDHDRA ALAHINSLPR
AQWNSNIQEL VNRLQSDQVL ETANRLRESG KEAEAEAMLR QQPPSTRIDL TLADWAQQRR
DYTAARAAYQ NVLTREPTNA DAILGLTEVD IAAGDTAAAR SQLAKLPATD NASLNTQRRV
ALAQAQLGDT AAAQQTFNKL IPQAKSQPPS MESAMVLRDG AKFEAQAGDP KQALETYKDA
MVASGVTTTR PQDNDTFTRL TRNDEKDDWL KRGVRSDAAD LYRQQDLNVT LEHDYWGSSG
TGGYSDLKAH TTMLQVDAPY SDGRMFFRSD FVNMNVGSFS TNADGKWDDN WGTCTLQDCS
GNRSQSDSGA SVAVGWRNDV WSWDIGTTPM GFNVVDVVGG ISYSDDIGPL GYTVNAHRRP
ISSSLLAFGG QKDSPSNTGK KWGGVRADGV GLSLSYDKGE ANGVWASLSG DQLTGKNVED
NWRVRWMTGY YYKVINQNNR RVTIGLNNMI WHYDKDLSGY SLGQGGYYSP QEYLSFAIPV
MWRERTENWS WELGASGSWS HSRTKTMPRY PLMNLIPTDW QEEAARQSND GGSSQGFGYT
ARALLERRVT SNWFVGTAID IQQAKDYAPS HFLLYVRYSA AGWQGDMDLP PQPLIPYADW