Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3851 |
Symbol | |
ID | 6067446 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4207706 |
End bp | 4211029 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641603266 |
Product | hypothetical protein |
Protein accession | YP_001726782 |
Protein GI | 170021828 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3264] Small-conductance mechanosensitive channel |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00332968 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCGCCTGA TTATCACTTT TCTGATGGCC TGGTGCCTCA GTTGGGGGGC GTACGCCGCG ACGGCCCCCG ATAGCAAACA AATCACTCAG GAACTGGAGC AGGCAAAAGC GGCGAAACCC GCACAGCCGG AAGTCGTAGA GGCGCTCCAG TCTGCCTTAA ATGCGCTTGA GGAACGAAAA GGTTCCCTTG AGCGCATCAA ACAATATCAG CAAGTTATCG ATAATTATCC GAAACTCTCC GCTACTCTGC GCGCACAATT AAACAACATG CGTGACGAGC CGCGCAGCGT GTCGCCGGGA ATGTCTACCG ACGCGCTGAA TCAGGAAATT CTCCAGGTCA GCAGCCAGTT GCTGGATAAA AGCCGTCAGG CCCAGCAAGA GCAGGAGCGC GCCCGCGAGA TTGCCGATTC GCTGAATCAA CTGCCGCAAC AGCAAACCGA CGCCCGCCGT CAGTTAAATG AGATCGAGCG CCGCCTGGGA ACGCTTACCG GCAATACTCC GCTCAATCAG GCACAAAATT TCGCGTTGCA GTCTGACTCT GCACGTCTTA AGGCGCTCGT TGATGAACTG GAGCTGGCGC AGCTGTCTGC CAATAACCGC CAGGAATTAG CGCGCTTACG CTCAGAGCTG GCGGAAAAAG AGAGCCAGCA ACTGGATGCG TATTTGCAGG CCTTGCGTAA TCAATTAAAC AGCCAACGTC AGCTTGAGGC GGAGCGGGCG CTGGAAAGTA CCGAATTGCT GGCAGAAAAC AGCGCCGATT TGCCGAAAGA TATCGTCGCG CAATTCAAAA TTAACCGCGA ACTATCGGCG GCTTTGAATC AACAGGCGCA GCGGATGGAT CTCATTGCCT CGCAACAGCG TCAGGCTGCC AGCCAGACGT TACAGGTCCG GCAGGCGTTG AATACGCTGC GTGAACAGTC GCAATGGCTG GGATCGTCCA ATCTGCTCGG CGAAGCGCTG CGGGCGCAGG TGGCACGGCT GCCGGAAATG CCGAAACCAC AACAGCTTGA TACCGAAATG GCGCAGTTGC GTGTGCAACG GTTACGTTAT GAGGATCTGC TTAATAAACA GCCGCTGCTA CGGCAAATTC ATCAGGCCGA CGGTCAGCCG CTGACTGCCG AGCAAAACCG TATTCTGGAA GCACAACTGC GCACTCAGCG TGAGTTGCTG AACTCATTGT TGCAGGGTGG CGACACGCTA CTATTGGAAC TGACCAAGCT GAAAGTCTCC AACGGGCAAC TGGAGGATGC GCTGAAAGAG GTGAACGAAG CAACGCACCG CTATCTGTTC TGGACCTCTG ACGTGCGCCC GATGACCATC GCCTGGCCGC TGGAAATCGC CCAGGATCTG CGTCGTCTCA TTTCGCTGGA CACCTTCAGT CAGTTGGGCA AAGCCAGTGT GATGATGCTG ACCAGCAAAG AGACAATTTT GCCGCTGTTT GGCGCGTTGA TTCTGGTCGG TTGCAGTATT TACTCGCGCC GCTATTTCAC CCGTTTTCTT GAACGTTCGG CGGCGAAAGT TGGCAAAGTG ACTCAGGATC ACTTCTGGCT GACGTTGCGC ACTCTTTTCT GGTCGATTCT CGTCGCGTCA CCGTTACCGG TGCTGTGGAT GACGCTGGGT TACGGCTTGC GCGAGGCGTG GCCTTATCCG CTGGCGGTCG CGATTGGCGA TGGTGTAACG GCCACCGTGC CGCTGCTGTG GGTAGTGATG ATTTGCGCCA CCTTTGCCCG CCCGAACGGC TTGTTTATCG CTCATTTTGG CTGGCCGCGC GAACGTGTTT CCCGTGGGAT GCGCTACTAC CTGATGAGCA TCGGGCTTAT TGTGCCGCTG ATTATGGCGC TGATGATGTT CGATAACCTC GACGACCGTG AATTCTCCGG TTCGCTGGGA CGGCTTTGCT TTATCCTCAT TTGCGGTGCG CTGGCGGTGG TCACCCTCAG CCTGAAAAAG GCCGGGATTC CGCTGTATCT CAACAAAGAG GGCAGCGGCG ACAACATTAC CAACCATATG CTGTGGAACA TGATGATTGG CGCGCCGTTG GTTGCCATTC TGGCGTCGGC GGTGGGTTAT CTGGCAACGG CACAGGCGCT GTTAGCGAGG CTTGAAACCT CGGTTGCCAT CTGGTTCCTG CTACTGGTGG TTTATCACGT TATCCGCCGC TGGATGCTGA TCCAGCGCCG CAGGCTGGCG TTTGATCGGG CGAAGCATCG CCGGGCAGAG ATGTTAGCGC AACGTGCGCG TGGCGAAGAG GAAGCGCATC ATCACAGTAG CCCGGAAGGA GCAATTGAAG TCGATGAAAG CGAAGTCGAT CTCGATGCCA TCAGTGCGCA ATCCTTGCGG CTGGTGCGCT CAATTTTGAT GTTGATCGCC CTGCTTTCTG TCATTGTGCT GTGGTCAGAA ATCCATTCCG CTTTCGGCTT CCTCGAAAAT ATTTCGCTGT GGGATGTCAC CTCCACGGTA CAGGGCGTAG AAAGTCTGGA GCCAATTACC CTCGGTGCGG TGCTGATTGC CATTCTGGTG TTTATCATCA CCACGCAGCT GGTGCGCAAC TTGCCCGCGC TGCTGGAACT GGCGATTTTG CAGCACCTGG ATTTAACGCC GGGTACGGGT TACGCCATCA CCACCATCAC CAAATATCTG CTGATGCTGA TTGGCGGGCT GGTCGGCTTC TCAATGATTG GTATTGAGTG GTCGAAATTG CAGTGGCTGG TTGCCGCGCT CGGTGTTGGT CTCGGTTTTG GTTTGCAGGA AATTTTCGCC AACTTTATCT CTGGTCTGAT TATCCTGTTC GAAAAACCGA TTCGCATTGG CGATACGGTG ACAATTCGCG ATCTCACCGG TAGCGTGACG AAAATTAACA CCCGCGCCAC CACCATCAGC GACTGGGACC GTAAAGAGAT AATCGTGCCG AACAAGGCGT TTATTACCGA GCAGTTTATC AACTGGTCGC TCTCTGACTC GGTCACGCGC GTGGTGTTGA CGATACCGGC CCCTGCCGAT GCCAATAGCG AAGAAGTGAC GGAAATCCTG CTCACCGCAG CGCGTCGCTG CTCGCTGGTG ATCGACAACC CGGCACCGGA AGTCTTCCTG GTGGATCTGC AACAGGGGAT TCAGATTTTC GAGCTGCGTA TTTACGCCGC TGAGATGGGT CACCGTATGC CGCTACGCCA TGAGATCCAC CAGCTGATTC TGGCTGGCTT CCATGCCCAC GGTATCGATA TGCCATTCCC GCCCTTCCAG ATGCGTCTGG AAAGCCTCAA CGGTAAACAA ACGGGGAGAA CGCTGACGTC TGCGGGCAAA GGTCGTCAGG CGGGAAGTTT GTAA
|
Protein sequence | MRLIITFLMA WCLSWGAYAA TAPDSKQITQ ELEQAKAAKP AQPEVVEALQ SALNALEERK GSLERIKQYQ QVIDNYPKLS ATLRAQLNNM RDEPRSVSPG MSTDALNQEI LQVSSQLLDK SRQAQQEQER AREIADSLNQ LPQQQTDARR QLNEIERRLG TLTGNTPLNQ AQNFALQSDS ARLKALVDEL ELAQLSANNR QELARLRSEL AEKESQQLDA YLQALRNQLN SQRQLEAERA LESTELLAEN SADLPKDIVA QFKINRELSA ALNQQAQRMD LIASQQRQAA SQTLQVRQAL NTLREQSQWL GSSNLLGEAL RAQVARLPEM PKPQQLDTEM AQLRVQRLRY EDLLNKQPLL RQIHQADGQP LTAEQNRILE AQLRTQRELL NSLLQGGDTL LLELTKLKVS NGQLEDALKE VNEATHRYLF WTSDVRPMTI AWPLEIAQDL RRLISLDTFS QLGKASVMML TSKETILPLF GALILVGCSI YSRRYFTRFL ERSAAKVGKV TQDHFWLTLR TLFWSILVAS PLPVLWMTLG YGLREAWPYP LAVAIGDGVT ATVPLLWVVM ICATFARPNG LFIAHFGWPR ERVSRGMRYY LMSIGLIVPL IMALMMFDNL DDREFSGSLG RLCFILICGA LAVVTLSLKK AGIPLYLNKE GSGDNITNHM LWNMMIGAPL VAILASAVGY LATAQALLAR LETSVAIWFL LLVVYHVIRR WMLIQRRRLA FDRAKHRRAE MLAQRARGEE EAHHHSSPEG AIEVDESEVD LDAISAQSLR LVRSILMLIA LLSVIVLWSE IHSAFGFLEN ISLWDVTSTV QGVESLEPIT LGAVLIAILV FIITTQLVRN LPALLELAIL QHLDLTPGTG YAITTITKYL LMLIGGLVGF SMIGIEWSKL QWLVAALGVG LGFGLQEIFA NFISGLIILF EKPIRIGDTV TIRDLTGSVT KINTRATTIS DWDRKEIIVP NKAFITEQFI NWSLSDSVTR VVLTIPAPAD ANSEEVTEIL LTAARRCSLV IDNPAPEVFL VDLQQGIQIF ELRIYAAEMG HRMPLRHEIH QLILAGFHAH GIDMPFPPFQ MRLESLNGKQ TGRTLTSAGK GRQAGSL
|
| |