Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4401 |
Symbol | |
ID | 5594883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4410788 |
End bp | 4414111 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640923499 |
Product | hypothetical protein |
Protein accession | YP_001460943 |
Protein GI | 157163625 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3264] Small-conductance mechanosensitive channel |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.199255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCCTGA TTATCACTTT TCTGATGGCC TGGTGCCTCA GTTGGGGGGC GTACGCCGCG ACGGCCCCCG ATAGCAAACA AATCACTCAG GAACTGGAGC AGGCAAAAGC GGCGAAACCC GCACAGCCGG AAGTCGTAGA GGCGCTCCAG TCTGCCTTAA ATGCGCTTGA GGAACGAAAA GGTTCCCTTG AGCGCATCAA ACAATATCAG CAAGTTATCG ATAATTATCC GAAACTCTCC GCTACTCTGC GCGCACAATT AAACAACATG CGTGACGAGC CGCGCAGCGT GTCGCCGGGA ATGTCTACCG ACGCGCTGAA TCAGGAAATT CTCCAGGTCA GCAGCCAGTT GCTGGATAAA AGCCGTCAGG CCCAGCAAGA GCAGGAGCGC GCCCGCGAGA TTGCCGATTC GCTGAATCAA CTGCCGCAAC AGCAAACCGA CGCCCGCCGT CAGTTAAATG AGATCGAGCG CCGCCTGGGA ACGCTTACTG GCAATACCCC GCTCAATCAG GCACAAAATT TCGCGTTGCA GTCTGACTCC GCACGCCTGA AAGCGCTCGT TGATGAACTG GAACTGGCGC AGCTGTCTGC CAATAACCGT CAGGAATTAG CGCGCTTACG CTCAGAGCTG GCGGAAAAAG AGAGCCAGCA ACTGGATGCG TATTTGCAGG CCTTACGTAA TCAGTTGAAC AGCCAACGTC AGCTTGAGGC GGAGCGGGCG CTGGAAAGTA CCGAATTGCT GGCAGAAAAC AGCGCCGATT TGCCGAAAGA CATCGTCGCG CAATTCAAAA TTAACCGCGA ACTATCGGCG GCACTGAATC AACAGGCGCA GCGGATGGAT CTCGTTGCCT CGCAACAGCG CCAGGCCACC AGCCAGACGT TACAGGTTCG GCAGGCGCTG AATACGCTGC GTGAACAGTC GCAATGGCTG GGATCGTCCA ACCTGCTCGG CGAAGCGTTG CGGGCGCAGG TGGCACGGCT GCCGGAAATG CCGAAACCAC AACAGCTTGA TACCGAAATG GCGCAGTTGC GTGTGCAACG GTTACGTTAT GAGGATCTGC TTAATAAACA GCCGCTGCTA CGGCAAATTC ATCAGGCCGA CGGTCAGCCG CTGACCGCCG AGCAAAACCG TATTCTGGAA GCACAGCTAC GCACTCAGCG TGAGTTGCTG AACTCATTGT TGCAGGGTGG CGACACGCTA CTGCTGGAAC TGACCAAGCT GAAAGTCTCC AACGGGCAAC TTGAGGATGC GCTGAAAGAG GTGAATGAAG CGACGCATCG CTATCTGTTC TGGACCTCTG ACGTGCGCCC GATGACCATC GCCTGGCCAC TGGAAATCGC CCAGGATCTG CGTCGTCTCA TTTCGCTGGA CACCTTCAGT CAGTTGGGCA AAGCCAGTGT GATGATGCTG ACCAGCAAAG AGACGATTTT GCCGCTGTTT GGCGCGTTGA TTCTGGTCGG TTGCAGTATT TACTCGCGCC GCTATTTCAC CCGTTTTCTT GAACGTTCGG CGGCGAAAGT CGGCAAAGTG ACTCAGGATC ACTTCTGGCT GACGTTGCGC ACTCTTTTCT GGTCGATTCT CGTCGCGTCA CCGCTGCCGG TGCTGTGGAT GACGCTGGGT TACGGCTTGC GCGAGGCGTG GCCTTATCCG CTGGCGGTCG CGATTGGCGA TGGCGTAACG GCCACCGTGC CGCTGCTGTG GGTAGTGATG ATTTGCGCCA CCTTTGCCCG CCCGAACGGC TTGTTTATCG CTCATTTTGG CTGGCCGCGC GAACGTGTTT CCCGTGGGAT GCGCTACTAC CTGATGAGCA TCGGGCTTAT TGTGCCGCTG ATTATGGCGC TGATGATGTT CGATAACCTC GACGACCGTG AATTCTCCGG TTCGCTGGGA CGGCTTTGCT TTATCCTCAT TTGCGGTGCG CTGGCGGTGG TCACCCTCAG CCTGAAAAAG GCCGGGATCC CGCTGTACCT CAACAAAGAG GGCAGCGGCG ACAACATTAC CAACCATATG CTGTGGAACA TGATGATTGG CGCACCACTG GTTGCCATTC TGGCGTCAGC GGTGGGTTAT CTGGCAACGG CGCAGGCACT GTTAGCGAGG CTTGAAACCT CGGTTGCCAT CTGGTTCCTG CTACTGGTGG TTTATCACGT TATCCGCCGC TGGATGCTGA TCCAGCGTCG CAGGCTGGCG TTTGACCGGG CGAAGCATCG CCGGGCAGAG ATGTTAGCGC AACGCGCGCG TGGCGAAGAG GAAGCGCATC ATCACAGTAG CCCGGAAGGG GCAATTGAAG TCGATGAAAG CGAAGTCGAT CTCGATGCCA TCAGTGCGCA ATCCTTGCGG CTGGTGCGCT CAATTTTGAT GTTGATCGCC CTGCTTTCTG TCATTGTGCT GTGGTCAGAA ATCCATTCCG CTTTCGGCTT CCTCGAAAAT ATTTCGCTGT GGGATGTCAC CTCCACGGTA CAGGGCGTAG AAAGTCTGGA GCCAATTACC CTCGGTGCGG TGCTGATTGC CATTCTGGTG TTTATCATCA CCACGCAGCT GGTGCGCAAC CTGCCCGCGC TGCTGGAACT GGCGATTTTG CAGCACCTGG ATTTAACGCC GGGTACGGGC TACGCCATCA CCACCATCAC CAAATATCTG CTGATGCTGA TTGGCGGGCT GGTCGGCTTC TCGATGATTG GTATTGAGTG GTCGAAATTG CAGTGGCTGG TTGCCGCGCT CGGTGTTGGT CTCGGTTTTG GTTTGCAGGA GATTTTCGCC AACTTTATCT CTGGCCTGAT TATCCTGTTC GAAAAACCGA TTCGCATTGG CGATACGGTA ACGATCCGCG ATCTTACCGG TAGCGTGACG AAAATTAACA CCCGCGCCAC CACCATCAGC GACTGGGACC GTAAAGAGAT AATCGTGCCG AACAAGGCGT TTATTACCGA GCAGTTTATC AACTGGTCGC TCTCTGACTC GGTCACGCGC GTGGTGTTGA CGATACCGGC CCCTGCCGAT GCCAATAGCG AAGAAGTGAC GGAAATCCTG CTCACCGCAG CGCGTCGCTG CTCGCTGGTG ATCGACAACC CGGCACCGGA AGTCTTCCTG GTGGATCTGC AACAGGGGAT TCAGATTTTC GAGCTGCGTA TTTACGCCGC TGAGATGGGT CACCGTATGC CGCTACGCCA TGAGATCCAC CAGCTGATTC TGGCAGGCTT CCACGCCCAC GGTATCGATA TGCCATTCCC GCCCTTCCAG ATGCGTCTGG AAAGCCTCAA CGGTAAGCAA ACGGGGAGAA CATTAACTTC TGCGGGCAAA GGTCGTCAGG CGGGAAGTTT GTAA
|
Protein sequence | MRLIITFLMA WCLSWGAYAA TAPDSKQITQ ELEQAKAAKP AQPEVVEALQ SALNALEERK GSLERIKQYQ QVIDNYPKLS ATLRAQLNNM RDEPRSVSPG MSTDALNQEI LQVSSQLLDK SRQAQQEQER AREIADSLNQ LPQQQTDARR QLNEIERRLG TLTGNTPLNQ AQNFALQSDS ARLKALVDEL ELAQLSANNR QELARLRSEL AEKESQQLDA YLQALRNQLN SQRQLEAERA LESTELLAEN SADLPKDIVA QFKINRELSA ALNQQAQRMD LVASQQRQAT SQTLQVRQAL NTLREQSQWL GSSNLLGEAL RAQVARLPEM PKPQQLDTEM AQLRVQRLRY EDLLNKQPLL RQIHQADGQP LTAEQNRILE AQLRTQRELL NSLLQGGDTL LLELTKLKVS NGQLEDALKE VNEATHRYLF WTSDVRPMTI AWPLEIAQDL RRLISLDTFS QLGKASVMML TSKETILPLF GALILVGCSI YSRRYFTRFL ERSAAKVGKV TQDHFWLTLR TLFWSILVAS PLPVLWMTLG YGLREAWPYP LAVAIGDGVT ATVPLLWVVM ICATFARPNG LFIAHFGWPR ERVSRGMRYY LMSIGLIVPL IMALMMFDNL DDREFSGSLG RLCFILICGA LAVVTLSLKK AGIPLYLNKE GSGDNITNHM LWNMMIGAPL VAILASAVGY LATAQALLAR LETSVAIWFL LLVVYHVIRR WMLIQRRRLA FDRAKHRRAE MLAQRARGEE EAHHHSSPEG AIEVDESEVD LDAISAQSLR LVRSILMLIA LLSVIVLWSE IHSAFGFLEN ISLWDVTSTV QGVESLEPIT LGAVLIAILV FIITTQLVRN LPALLELAIL QHLDLTPGTG YAITTITKYL LMLIGGLVGF SMIGIEWSKL QWLVAALGVG LGFGLQEIFA NFISGLIILF EKPIRIGDTV TIRDLTGSVT KINTRATTIS DWDRKEIIVP NKAFITEQFI NWSLSDSVTR VVLTIPAPAD ANSEEVTEIL LTAARRCSLV IDNPAPEVFL VDLQQGIQIF ELRIYAAEMG HRMPLRHEIH QLILAGFHAH GIDMPFPPFQ MRLESLNGKQ TGRTLTSAGK GRQAGSL
|
| |