Gene EcolC_0180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0180 
Symbol 
ID6064809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp196625 
End bp198184 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content49% 
IMG OID641599582 
Productputative protease 
Protein accessionYP_001723189 
Protein GI170018235 
COG category 
COG ID 
TIGRFAM ID[TIGR03369] cellulose biosynthesis protein BcsE 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACCCTG TATTCTCTAT CGGTATCTCA TCATTATGGG ATGAGCTGCG ACATATGCCA 
GCAGGCGGCG TCTGGTGGTT TAACGTCGAT CGCCATGAAG ATGCTATCAG TCTGGCGAAT
CAAACAATTG CATCCCAGGC TGAAACCGCA CACGTCGCGG TCATTAGCAT GGACAGCGAT
CCGGCGAAAA TCTTTCAATT AGATGATTCT CAAGGGCCGG AAAAAATAAA ATTATTTTCA
ATGCTAAATC ATGAAAAAGG TCTATACTAT TTGACCCGTG ATTTGCAGTG ATCTATTGAT
CCCCATAATT ACCTTTTTAT TCTTGTTTGC GCAAATAACG CATGGCAAAA CATTCCTGCC
GAGCGGCTTC GCTCATGGTT GGATAAAATG AATAAATGGA GCAGGTTAAA CCATTGTTCG
CTTTTGGTAA TTAATCCCGG AAATAATAAC GATAAACAAT TTTCATTGTT GCTTGAGGAA
TACCGTTCAC TTTTTGGTCT TGCCAGTTTG CGTTTTCAGG GTGACCAACA TTTGCTGGAT
ATTGCCTTCT GGTGCAACGA AAAAGGGGTC AGCGCCCGTC AGCAGCTTAG CGTTCAGCAA
CAAAATGGTA TCTGGACATT AGTTCAAAGC GAAGAGGCGG AGATCCAACC ACGCAGCGAC
GAAAAACGCA TTCTGAGTAA TGTTGCTGTA CTGGAAGGTG CGCCGCCGCT ATCGGAACAC
TGGCAACTGT TCAACAATAA CGAAGTCCTG TTCAATGAAG CCCGTACCGC TCAGGCGGCG
ACGGTGGTCT TTTCTTTACA GCAAAATGCG CAAATCGAGC CACTGGCCCG CAGCATTCAT
ACCCTGCGTC GCCAGCGCGG TAGTGCGATG AAAATCCTCG TGCGGGAAAA TACCGCTAGC
CTGCGCGCCA CCGATGAACG TTTGTTATTG GCCTGCGGTG CAAATATGGT TATTCCGTGG
AATGCGCCAC TCTCCCGTTG TCTGACGATG ATCGAAAGCG TGCAAGGGCA GAAGTTTAGT
CGCTATGTGC CGGAAGATAT CACTACCTTG CTGTCAATGA CCCAGCCGCT CAAACTGCGT
GGTTTCCAGA AGTGGGATGT GTTCTGTAAT GCCGTCAACA ACATGATGAA TAACCCTCTA
TTACCTGCCC ACGGTAAAGG CGTTCTGGTT GCCCTACGTC CGGTACCGGG TATCCGCGTT
GAACAAGCCC TGACGCTGTG TCGCCCTAAC CGTACCGGCG ATATCATGAC CATTGGCGGT
AATCGGCTGG TGCTGTTTCT CTCATTCTGT CGGATTAACG ATCTGGATAC CGCGTTGAAT
CATATTTTCC CATTGCCTAC TGGCGACATT TTCTCAAACC GTATGGTCTG GTTTGAAGAT
GATCAAATCA GTGCCGAGCT GGTGCAGATG CGCTTGCTTG CCCCAGAACA ATGGGGCATG
CCGCTGCCTT TAACGCAAAG TTCTAAACCG GTCATCAATG CCGAGCACGA TGGTCGCCAC
TGGCGACGAA TACCAGAACC CATGCGACTG TTAGATGATG CTGTGGAGCG CTCATCATGA
 
Protein sequence
MDPVFSIGIS SLWDELRHMP AGGVWWFNVD RHEDAISLAN QTIASQAETA HVAVISMDSD 
PAKIFQLDDS QGPEKIKLFS MLNHEKGLYY LTRDLQUSID PHNYLFILVC ANNAWQNIPA
ERLRSWLDKM NKWSRLNHCS LLVINPGNNN DKQFSLLLEE YRSLFGLASL RFQGDQHLLD
IAFWCNEKGV SARQQLSVQQ QNGIWTLVQS EEAEIQPRSD EKRILSNVAV LEGAPPLSEH
WQLFNNNEVL FNEARTAQAA TVVFSLQQNA QIEPLARSIH TLRRQRGSAM KILVRENTAS
LRATDERLLL ACGANMVIPW NAPLSRCLTM IESVQGQKFS RYVPEDITTL LSMTQPLKLR
GFQKWDVFCN AVNNMMNNPL LPAHGKGVLV ALRPVPGIRV EQALTLCRPN RTGDIMTIGG
NRLVLFLSFC RINDLDTALN HIFPLPTGDI FSNRMVWFED DQISAELVQM RLLAPEQWGM
PLPLTQSSKP VINAEHDGRH WRRIPEPMRL LDDAVERSS