Gene EcolC_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1787 
Symbol 
ID6066560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1987075 
End bp1989135 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content50% 
IMG OID641601202 
Productprotease 2 
Protein accessionYP_001724764 
Protein GI170019810 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.13117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACCAA AAGCCGCCCG CATTCCCCAC GCCATGACGC TTCATGGCGA TACGCGCATC 
GATAATTACT ACTGGCTGCG GGACGATACG CGTTCTCAGC CGGAAGTCCT GGACTACCTG
CAACAAGAAA ATAGTTACGG TCATCGGGTG ATGGCCTCAC AACAAGCCTT GCAGGATCGC
ATCTTAAAGG AAATCATCGA CCGCATTCCG CAAAGAGAAG TTTCTGCGCC CTACATCAAA
AATGGCTACC GCTATCGGCA TATTTATGAA CCAGGCTGTG AATATGCTAT CTACCAGCGT
CAATCGGCGT TCAGTGAAGA GTGGGACGAG TGGGAAATAT TGCTCGATGC CAACAAGCGC
GCGGCTCATA GTGAGTTTTA TTCGATGGGC GGAATGGCGA TTACGCCCGA TAACACCATT
ATGGCGCTGG CAGAAGATTT TCTTTCCCGA CGCCAGTACG GCATTCGTTT TCGTAATCTG
GAAACAGGTA ACTGGTACCC GGAACTGCTG GATAACGTTG AACCCAGCTT TGTCTGGGCA
AATGACTCCT GGACTTTCTA CTATGTTCGC AAGCATCCAG TGACGCTGCT GCCTTATCAG
GTCTGGCGTC ACGCTATCGG TACTCCAGCA TCGCAAGATA AACTGATCTA CGAAGAAAAA
GACGATACCT ATTACGTCAG CCTGCATAAA ACGACGTCGA AGCACTATGT AGTCATTCAT
TTGGCCAGCG CCACCACCAG TGAAGTTCGC CTGCTGGACG CGGAAATGGC CGATGCCGAG
CCGTTTGTTT TTCTGCCGCG CCGCAAAGAT CACGAATACA GCCTTGATCA CTACCAGCAT
CGGTTTTATC TGCGTTCCAA CCGCAACGGC AAAAACTTTG GCTTATACCG TACCCGTATG
CGTGATGAGC AACAGTGGGA AGAGTTAATT CCGCCACGCG AAAACATCAT GCTGGAAGGG
TTTACGCTGT TTACCGACTG GCTGGTGGTT GAAGAGCGTC AGCGCGGGTT AACCAGTTTG
CGCCAAATTA ACCGCAAGAC CCGGGAAGTC ATTGGTATTG CCTTTGATGA TCCGGCCTAT
GTGACCTGGA TTGCCTACAA TCCAGAATCT GAAACCGCGC GATTGCGTTA TGGTTATTCT
TCCATGACCA CACCAGACAC TTTGTTTGAA CTGGATATGG ATACCGGTGA GCGTCGTGTA
TTAAAACAAA CGGAAGTTCC TGGTTTTGAT GCGGCGAATT ACCGCAGTGA ACACCTGTGG
ATAGTCGCCC GTGATGGCGT CGAAGTTCCG GTTTCGCTGG TCTATCATCG CAAACATTTT
CGCAAAGGAC ACAACCCGCT GCTGGTGTAT GGCTATGGTT CTTACGGCGC AAGTATTGAT
GCCGATTTCA GTTTTAGCCG CTTGAGTTTG TTAGATCGTG GCTTTGTCTA CGCCATTGTC
CATGTTCGCG GCGGTGGTGA GCTGGGGCAA CAATGGTACG AAGACGGAAA ATTTCTGAAG
AAGAAAAATA CGTTTAATGA TTATCTTGAT GCCTGCGATG CATTGTTAAA ACTGGGCTAT
GGCTCTCCTT CGCTTTGTTA TGCGATGGGC GGGAGTGCGG GGGGCATGTT GATGGGCGTT
GCGATTAATG CACGCCCTGA ATTATTCCAC GGCGTTATCG CCCAGGTACC GTTTGTTGAT
GTTGTAACAA CAATGCTTGA TGAATCAATT CCTCTTACCA CTGGTGAGTT TGAAGAGTGG
GGGAATCCGC AGGATCCGCA ATATTACGAG TATATGAAAA GCTACAGCCC ATATGACAAC
GTCACCGCAC AGGCTTATCC GCATTTACTG GTAACGACCG GTTTGCACGA TTCTCAGGTG
CAATATTGGG AACCGGCAAA ATGGGTCGCT AAATTGCGCG AGCTGAAAAC CGATGACCAT
CTTTTATTGC TCTGTACCGA CATGGACTCA GGCCATGGCG GTAAATCTGG TCGCTTTAAA
TCGTACGAAG GCGTAGCGAT GGAATATGCT TTTCTGGTCG CGCTGGCGCA GGGAACATTA
CCCGCTACGC CTGCGGATTA A
 
Protein sequence
MLPKAARIPH AMTLHGDTRI DNYYWLRDDT RSQPEVLDYL QQENSYGHRV MASQQALQDR 
ILKEIIDRIP QREVSAPYIK NGYRYRHIYE PGCEYAIYQR QSAFSEEWDE WEILLDANKR
AAHSEFYSMG GMAITPDNTI MALAEDFLSR RQYGIRFRNL ETGNWYPELL DNVEPSFVWA
NDSWTFYYVR KHPVTLLPYQ VWRHAIGTPA SQDKLIYEEK DDTYYVSLHK TTSKHYVVIH
LASATTSEVR LLDAEMADAE PFVFLPRRKD HEYSLDHYQH RFYLRSNRNG KNFGLYRTRM
RDEQQWEELI PPRENIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY
VTWIAYNPES ETARLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD AANYRSEHLW
IVARDGVEVP VSLVYHRKHF RKGHNPLLVY GYGSYGASID ADFSFSRLSL LDRGFVYAIV
HVRGGGELGQ QWYEDGKFLK KKNTFNDYLD ACDALLKLGY GSPSLCYAMG GSAGGMLMGV
AINARPELFH GVIAQVPFVD VVTTMLDESI PLTTGEFEEW GNPQDPQYYE YMKSYSPYDN
VTAQAYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDDH LLLLCTDMDS GHGGKSGRFK
SYEGVAMEYA FLVALAQGTL PATPAD