Gene EcE24377A_4020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_4020 
SymbolbcsB 
ID5589747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp4002131 
End bp4004470 
Gene Length2340 bp 
Protein Length779 aa 
Translation table11 
GC content55% 
IMG OID640927641 
Productcellulose synthase regulator protein 
Protein accessionYP_001465002 
Protein GI157156498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA AACTATTCTG GATTTGTGCA GTGGCTATGG GGATGAGTGC GTTCCCCTCT 
TTCATGACGC AGGCGACGCC AGCAACGCAA CCACTGATCA ATGCTGAGCC AGCTGTAGCC
GCCCAGACGG AACAAAATCC GCAGGTGGGG CAAGTGATGC CGGGCGTGCA GGGCGCTGAT
GCGCCAGTCG TGGCGCAGAA CGGTCCTTCG CGTGATGTGA AGCTGACCTT TGCGCAAATT
GCGCCGCCGC CGGGCAGCAT GGTGCTACGT GGCATTAACC CGAACGGCAG CATTGAGTTT
GGTATGCGCA GCGATGAAGT GGTGACGAAG GCGATGCTCA ACCTCGAATA CACGCCATCG
CCATCGTTAC TGCCTGTCCA GTCGCAGTTA AAGGTTTATC TCAATGATGA GCTGATGGGC
GTGCTGCCAG TGACCAAAGA ACAGTTGGGT AAAAAAACGC TGGCGCAAAT GCCCATTAAC
CCACTGTTTA TTACCGACTT CAACCGTGTG CGGCTGGAGT TTGTCGGCCA TTATCAGGAC
GTGTGCGAAA ACCCGGCCAG CACCACGCTT TGGCTGGATG TTGGGCGGAG CAGTGGACTG
GATCTGACCT ATCAGACCCT GAATGTGAAG AATGACCTGT CACACTTCCC GGTGCCATTC
TTTGATCCGC GTGATAACCG CACCAACACC TTGCCGATGG TCTTTGCGGG TGCGCCGGAT
GTTGGGCTGC AACAAGCCTC TGCCATTGTC GCCTCGTGGT TTGGTTCGCG TTCTGGCTGG
CGTGGGCAGA ACTTCCCGGT GCTCTATAAC CAACTGCCGG ATCGTAATGC CATTGTCTTT
GCAACCAACG ACAAACGGCC GGACTTCCTG CGCGATCATC CGGCGGTAAA AGCCCCGGTG
ATTGAGATGA TTAACCATCC GCAGAATCCT TACGTCAAAC TGCTGGTGGT GTTTGGTCGT
GACGACAAAG ACCTGTTGCA GGCAGCGAAA GGTATCGCTC AGGGTAACAT TCTGTTCCGT
GGTGAAAGCG TGGTAGTGAA TGAAGTGAAA CCGCTGCTAC CGCGTAAGCC GTACGATGCG
CCGAACTGGG TACGTACCGA TCGTCCGGTC ACATTTGGCG AACTGAAAAC CTATGAAGAA
CAGTTACAAT CCAGCGGTCT TGAGCCAGCA GCGATTAACG TTTCGCTAAA CCTGCCGCCG
GATCTCTACC TGATGCGCAG TACCGGCATT GATATGGATA TTAATTACCG CTACACCATG
CCGCCGGTGA AAGACAGTTC GCGGATGGAT ATCAGCCTGA ATAACCAGTT CCTGCAATCC
TTCAACCTGA GCAGCAAACA GGAGGCGAAC CGCCTGCTGC TGCGGATTCC GGTATTACAA
GGTTTGCTGG ATGGCAAAAC AGATGTCTCT ATTCCGGCGC TGAAACTGGG CGCGACCAAC
CAGCTGCGCT TCGACTTTGA GTATATGAAC CCGATGCCGG GCGGTTCGGT GGATAACTGT
ATTACCTTCC AGCCGGTGCA GAATCATGTG GTGATTGGTG ACGACTCCAC CATCGACTTC
TCGAAGTATT ACCACTTCAT CCCGATGCCG GATCTACGCG CCTTTGCTAA CGCGGGCTTC
CCATTCAGCC GGATGGCGGA TCTGTCGCAA ACCATCACCG TGATGCCGAA AGCGCCTAAC
GAAGCACAGA TGGAAACGTT GCTGAATACT GTTGGTTTTA TCGGCGCACA GACGGGCTTC
CCGGCGATTA ATCTGACGGT GACCGATGAT GGCAGCACCA TTCAGGGCAA AGATGCCGAC
ATCATGATCA TCGGTGGTAT CCCGGACAAA CTGAAAGACG ATAAGCAGAT CGACCTATTG
GTGCAGGCGA CCGAAAGCTG GGTGAAAACA CCGATGCGCC AGACCCCGTT CCCCGGCATT
GTGCCGGACG AGAGCGATCG CGCGGCAGAA ACCCAGTCAA CGCTGATCTC TTCCGGTGCG
ATGGCGGCGG TGATTGGCTT CCAGTCGCCG TATAACGACC AGCGCAGCGT GATTGCGCTG
CTGGCAGATA GCCCACGCGG TTATGAAATG CTTAACGATG CGGTGAACGA TAGCGGCAAA
CGCGCCACCA TGTTCGGTTC GGTCGCGGTG ATCCGCGAGT CCGGTATCAA CAGCCTACGT
GTTGGCGACG TTTATTACGT AGGTCATCTG CCGTGGTTCG AGCGCTTGTG GTATGCGCTG
GCAAACCATC CGATTCTGCT GGCGGTGCTG GCGGCTATCA GTGTGATATT GCTGGCATGG
GTACTGTGGC GTCTGCTGCG AATTATTAGT CGTCGTCGTC TTAACCCGGA TAACGAGTAA
 
Protein sequence
MKRKLFWICA VAMGMSAFPS FMTQATPATQ PLINAEPAVA AQTEQNPQVG QVMPGVQGAD 
APVVAQNGPS RDVKLTFAQI APPPGSMVLR GINPNGSIEF GMRSDEVVTK AMLNLEYTPS
PSLLPVQSQL KVYLNDELMG VLPVTKEQLG KKTLAQMPIN PLFITDFNRV RLEFVGHYQD
VCENPASTTL WLDVGRSSGL DLTYQTLNVK NDLSHFPVPF FDPRDNRTNT LPMVFAGAPD
VGLQQASAIV ASWFGSRSGW RGQNFPVLYN QLPDRNAIVF ATNDKRPDFL RDHPAVKAPV
IEMINHPQNP YVKLLVVFGR DDKDLLQAAK GIAQGNILFR GESVVVNEVK PLLPRKPYDA
PNWVRTDRPV TFGELKTYEE QLQSSGLEPA AINVSLNLPP DLYLMRSTGI DMDINYRYTM
PPVKDSSRMD ISLNNQFLQS FNLSSKQEAN RLLLRIPVLQ GLLDGKTDVS IPALKLGATN
QLRFDFEYMN PMPGGSVDNC ITFQPVQNHV VIGDDSTIDF SKYYHFIPMP DLRAFANAGF
PFSRMADLSQ TITVMPKAPN EAQMETLLNT VGFIGAQTGF PAINLTVTDD GSTIQGKDAD
IMIIGGIPDK LKDDKQIDLL VQATESWVKT PMRQTPFPGI VPDESDRAAE TQSTLISSGA
MAAVIGFQSP YNDQRSVIAL LADSPRGYEM LNDAVNDSGK RATMFGSVAV IRESGINSLR
VGDVYYVGHL PWFERLWYAL ANHPILLAVL AAISVILLAW VLWRLLRIIS RRRLNPDNE