Gene EcolC_0185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0185 
Symbol 
ID6068243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp202077 
End bp204386 
Gene Length2310 bp 
Protein Length769 aa 
Translation table11 
GC content55% 
IMG OID641599586 
Productcellulose synthase regulator protein 
Protein accessionYP_001723193 
Protein GI170018239 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTATGG GGATGAGTGC GTTCCCCTCT TTCATGACGC AGGCGACGCC AGCAACGCAA 
CCACTGATCA ATGCTGAGCC AGCTGTAGCC GCCCAGACGG AACAAAATCC GCAGGTGGGG
CAAGTGATGC CGGGCGTGCA GGGCGCTGAT GCGCCAGTCG TGGCGCAGAA CGGTCCTTCG
CGTGATGTGA AGCTGACCTT TGCGCAAATT GCACCGCCGC CGGGCAGCAT GGTGCTACGT
GGCATTAACC CGAACGGCAG CATTGAGTTT GGTATGCGCA GCGATGAAGT GGTGACGAAG
GCGATGCTCA ACCTCGAATA CACCCCATCG CCATCGTTAC TGCCTGTCCA GTCGCAGTTA
AAGGTTTATC TCAATGATGA ACTGATGGGC GTGCTGCCAG TGACCAAAGA ACAGTTGGGT
AAAAAAACGC TGGCGCAAAT GCCCATTAAC CCACTGTTTA TTACCGACTT CAACCGTGTA
CGGCTGGAGT TTGTCGGCCA TTATCAGGAC GTGTGCGAAA ACCCGGCCAG CACCACGCTT
TGGCTGGATG TTGGGCGGAG CAGTGGACTG GATCTGACCT ATCAGACCCT GAATGTGAAG
AATGACCTGT CACACTTCCC GGTGCCATTC TTTGACCCGC GCGATAACCG TACTAACACC
TTGCCGATGG TCTTTGCGGG TGCGCCGGAT GTTGGGCTGC AACAAGCCTC TGCCATTGTC
GCCTCGTGGT TTGGTTCGCG TTCTGGCTGG CGTGGGCAGA ACTTCCCGGT GCTCTATAAC
CAACTGCCGG ATCGTAATGC CATTGTCTTT GCAACCAACG ACAAACGGCC GGACTTCCTG
CGCGATCATC CGGCGGTAAA AGCCCCGGTG ATTGAGATGA TTAACCATCC GCAGAATCCT
TACGTCAAAC TGCTGGTGGT GTTTGGTCGT GACGACAAAG ACCTGTTGCA GGCAGCGAAA
GGTATCGCTC AGGGTAACAT TCTGTTCCGT GGTGAAAGCG TGGTAGTGAA TGAAGTGAAA
CCGCTGCTAC CGCGTAAGCC TTACGATGCG CCGAACTGGG TACGTACCGA TCGTCCGGTC
ACCTTTGGTG AACTGAAAAC CTATGAAGAA CAGTTACAAT CCAGCGGTCT TGAGCCAGCA
GCGATTAACG TTTCGCTAAA CCTGCCACCG GATCTCTACC TGATGCGCAG TACCGGCATT
GATATGGATA TCAACTATCG CTACACCATG CCGCCGGTGA AAGACAGTTC GCGGATGGAT
ATCAGCCTGA ATAACCAGTT CCTGCAATCC TTCAACCTGA GCAGCAAACA GGAGGCGAAC
CGCCTGCTGC TGCGGATTCC GGTATTACAA GGTTTGCTGG ATGGCAAAAC AGATGTCTCT
ATTCCGGCGC TGAAACTGGG CGCGACCAAC CAGCTGCGCT TCGACTTTGA GTATATGAAC
CCGATGCCAG GCGGTTCGGT AGATAACTGT ATTACCTTCC AGCCGGTGCA GAATCATGTG
GTGATTGGTG ACGATTCCAC CATCGACTTC TCGAAGTATT ACCACTTCAT CCCGATGCCG
GATCTACGCG CCTTTGCTAA CGCGGGCTTC CCATTCAGCC GGATGGCGGA TCTGTCGCAA
ACCATCACCG TGATGCCGAA AGCGCCTAAC GAAGCACAGA TGGAAACGTT GCTGAATACT
GTTGGTTTTA TCGGCGCACA GACGGGCTTC CCGGCGATTA ATCTGACGGT GACCGATGAT
GGCAGCACCA TTCAGGGCAA AGATGCCGAC ATCATGATCA TCGGTGGTAT CCCGGACAAA
CTGAAAGACG ATAAGCAGAT CGACCTATTG GTGCAGGCGA CCGAAAGCTG GGTGAAAACA
CCGATGCGCC AGACCCCGTT CCCCGGCATT GTACCGGACG AGAGCGATCG CGCGGCAGAA
ACCCGGTCAA CGCTGACCTC TTCCGGTGCG ATGGCGGCGG TGATTGGCTT CCAGTCGCCG
TATAACGACC AGCGCAGCGT GATTGCGCTG CTGGCAGATA GCCCACGCGG TTATGAAATG
CTTAACGATG CGGTGAACGA TAGCGGCAAA CGCGCCACCA TGTTCGGTTC GGTCGCGGTG
ATCCGCGAGT CCGGTATCAA CAGCCTACGT GTTGGCGACG TTTATTACGT AGGTCATCTG
CCGTGGTTCG AGCGCTTGTG GTATGCGCTG GCAAACCATC CGATTCTGCT GGCGGTGCTG
GCGGCAATCA GTGTGATATT GCTGGCATGG GTACTGTGGC GTCTGCTGCG AATTATTAGT
CGTCGTCGTC TTAACCCGGA TAACGAGTAA
 
Protein sequence
MAMGMSAFPS FMTQATPATQ PLINAEPAVA AQTEQNPQVG QVMPGVQGAD APVVAQNGPS 
RDVKLTFAQI APPPGSMVLR GINPNGSIEF GMRSDEVVTK AMLNLEYTPS PSLLPVQSQL
KVYLNDELMG VLPVTKEQLG KKTLAQMPIN PLFITDFNRV RLEFVGHYQD VCENPASTTL
WLDVGRSSGL DLTYQTLNVK NDLSHFPVPF FDPRDNRTNT LPMVFAGAPD VGLQQASAIV
ASWFGSRSGW RGQNFPVLYN QLPDRNAIVF ATNDKRPDFL RDHPAVKAPV IEMINHPQNP
YVKLLVVFGR DDKDLLQAAK GIAQGNILFR GESVVVNEVK PLLPRKPYDA PNWVRTDRPV
TFGELKTYEE QLQSSGLEPA AINVSLNLPP DLYLMRSTGI DMDINYRYTM PPVKDSSRMD
ISLNNQFLQS FNLSSKQEAN RLLLRIPVLQ GLLDGKTDVS IPALKLGATN QLRFDFEYMN
PMPGGSVDNC ITFQPVQNHV VIGDDSTIDF SKYYHFIPMP DLRAFANAGF PFSRMADLSQ
TITVMPKAPN EAQMETLLNT VGFIGAQTGF PAINLTVTDD GSTIQGKDAD IMIIGGIPDK
LKDDKQIDLL VQATESWVKT PMRQTPFPGI VPDESDRAAE TRSTLTSSGA MAAVIGFQSP
YNDQRSVIAL LADSPRGYEM LNDAVNDSGK RATMFGSVAV IRESGINSLR VGDVYYVGHL
PWFERLWYAL ANHPILLAVL AAISVILLAW VLWRLLRIIS RRRLNPDNE