Gene EcHS_A4401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4401 
Symbol 
ID5594883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4410788 
End bp4414111 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content56% 
IMG OID640923499 
Producthypothetical protein 
Protein accessionYP_001460943 
Protein GI157163625 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3264] Small-conductance mechanosensitive channel 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.199255 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCTGA TTATCACTTT TCTGATGGCC TGGTGCCTCA GTTGGGGGGC GTACGCCGCG 
ACGGCCCCCG ATAGCAAACA AATCACTCAG GAACTGGAGC AGGCAAAAGC GGCGAAACCC
GCACAGCCGG AAGTCGTAGA GGCGCTCCAG TCTGCCTTAA ATGCGCTTGA GGAACGAAAA
GGTTCCCTTG AGCGCATCAA ACAATATCAG CAAGTTATCG ATAATTATCC GAAACTCTCC
GCTACTCTGC GCGCACAATT AAACAACATG CGTGACGAGC CGCGCAGCGT GTCGCCGGGA
ATGTCTACCG ACGCGCTGAA TCAGGAAATT CTCCAGGTCA GCAGCCAGTT GCTGGATAAA
AGCCGTCAGG CCCAGCAAGA GCAGGAGCGC GCCCGCGAGA TTGCCGATTC GCTGAATCAA
CTGCCGCAAC AGCAAACCGA CGCCCGCCGT CAGTTAAATG AGATCGAGCG CCGCCTGGGA
ACGCTTACTG GCAATACCCC GCTCAATCAG GCACAAAATT TCGCGTTGCA GTCTGACTCC
GCACGCCTGA AAGCGCTCGT TGATGAACTG GAACTGGCGC AGCTGTCTGC CAATAACCGT
CAGGAATTAG CGCGCTTACG CTCAGAGCTG GCGGAAAAAG AGAGCCAGCA ACTGGATGCG
TATTTGCAGG CCTTACGTAA TCAGTTGAAC AGCCAACGTC AGCTTGAGGC GGAGCGGGCG
CTGGAAAGTA CCGAATTGCT GGCAGAAAAC AGCGCCGATT TGCCGAAAGA CATCGTCGCG
CAATTCAAAA TTAACCGCGA ACTATCGGCG GCACTGAATC AACAGGCGCA GCGGATGGAT
CTCGTTGCCT CGCAACAGCG CCAGGCCACC AGCCAGACGT TACAGGTTCG GCAGGCGCTG
AATACGCTGC GTGAACAGTC GCAATGGCTG GGATCGTCCA ACCTGCTCGG CGAAGCGTTG
CGGGCGCAGG TGGCACGGCT GCCGGAAATG CCGAAACCAC AACAGCTTGA TACCGAAATG
GCGCAGTTGC GTGTGCAACG GTTACGTTAT GAGGATCTGC TTAATAAACA GCCGCTGCTA
CGGCAAATTC ATCAGGCCGA CGGTCAGCCG CTGACCGCCG AGCAAAACCG TATTCTGGAA
GCACAGCTAC GCACTCAGCG TGAGTTGCTG AACTCATTGT TGCAGGGTGG CGACACGCTA
CTGCTGGAAC TGACCAAGCT GAAAGTCTCC AACGGGCAAC TTGAGGATGC GCTGAAAGAG
GTGAATGAAG CGACGCATCG CTATCTGTTC TGGACCTCTG ACGTGCGCCC GATGACCATC
GCCTGGCCAC TGGAAATCGC CCAGGATCTG CGTCGTCTCA TTTCGCTGGA CACCTTCAGT
CAGTTGGGCA AAGCCAGTGT GATGATGCTG ACCAGCAAAG AGACGATTTT GCCGCTGTTT
GGCGCGTTGA TTCTGGTCGG TTGCAGTATT TACTCGCGCC GCTATTTCAC CCGTTTTCTT
GAACGTTCGG CGGCGAAAGT CGGCAAAGTG ACTCAGGATC ACTTCTGGCT GACGTTGCGC
ACTCTTTTCT GGTCGATTCT CGTCGCGTCA CCGCTGCCGG TGCTGTGGAT GACGCTGGGT
TACGGCTTGC GCGAGGCGTG GCCTTATCCG CTGGCGGTCG CGATTGGCGA TGGCGTAACG
GCCACCGTGC CGCTGCTGTG GGTAGTGATG ATTTGCGCCA CCTTTGCCCG CCCGAACGGC
TTGTTTATCG CTCATTTTGG CTGGCCGCGC GAACGTGTTT CCCGTGGGAT GCGCTACTAC
CTGATGAGCA TCGGGCTTAT TGTGCCGCTG ATTATGGCGC TGATGATGTT CGATAACCTC
GACGACCGTG AATTCTCCGG TTCGCTGGGA CGGCTTTGCT TTATCCTCAT TTGCGGTGCG
CTGGCGGTGG TCACCCTCAG CCTGAAAAAG GCCGGGATCC CGCTGTACCT CAACAAAGAG
GGCAGCGGCG ACAACATTAC CAACCATATG CTGTGGAACA TGATGATTGG CGCACCACTG
GTTGCCATTC TGGCGTCAGC GGTGGGTTAT CTGGCAACGG CGCAGGCACT GTTAGCGAGG
CTTGAAACCT CGGTTGCCAT CTGGTTCCTG CTACTGGTGG TTTATCACGT TATCCGCCGC
TGGATGCTGA TCCAGCGTCG CAGGCTGGCG TTTGACCGGG CGAAGCATCG CCGGGCAGAG
ATGTTAGCGC AACGCGCGCG TGGCGAAGAG GAAGCGCATC ATCACAGTAG CCCGGAAGGG
GCAATTGAAG TCGATGAAAG CGAAGTCGAT CTCGATGCCA TCAGTGCGCA ATCCTTGCGG
CTGGTGCGCT CAATTTTGAT GTTGATCGCC CTGCTTTCTG TCATTGTGCT GTGGTCAGAA
ATCCATTCCG CTTTCGGCTT CCTCGAAAAT ATTTCGCTGT GGGATGTCAC CTCCACGGTA
CAGGGCGTAG AAAGTCTGGA GCCAATTACC CTCGGTGCGG TGCTGATTGC CATTCTGGTG
TTTATCATCA CCACGCAGCT GGTGCGCAAC CTGCCCGCGC TGCTGGAACT GGCGATTTTG
CAGCACCTGG ATTTAACGCC GGGTACGGGC TACGCCATCA CCACCATCAC CAAATATCTG
CTGATGCTGA TTGGCGGGCT GGTCGGCTTC TCGATGATTG GTATTGAGTG GTCGAAATTG
CAGTGGCTGG TTGCCGCGCT CGGTGTTGGT CTCGGTTTTG GTTTGCAGGA GATTTTCGCC
AACTTTATCT CTGGCCTGAT TATCCTGTTC GAAAAACCGA TTCGCATTGG CGATACGGTA
ACGATCCGCG ATCTTACCGG TAGCGTGACG AAAATTAACA CCCGCGCCAC CACCATCAGC
GACTGGGACC GTAAAGAGAT AATCGTGCCG AACAAGGCGT TTATTACCGA GCAGTTTATC
AACTGGTCGC TCTCTGACTC GGTCACGCGC GTGGTGTTGA CGATACCGGC CCCTGCCGAT
GCCAATAGCG AAGAAGTGAC GGAAATCCTG CTCACCGCAG CGCGTCGCTG CTCGCTGGTG
ATCGACAACC CGGCACCGGA AGTCTTCCTG GTGGATCTGC AACAGGGGAT TCAGATTTTC
GAGCTGCGTA TTTACGCCGC TGAGATGGGT CACCGTATGC CGCTACGCCA TGAGATCCAC
CAGCTGATTC TGGCAGGCTT CCACGCCCAC GGTATCGATA TGCCATTCCC GCCCTTCCAG
ATGCGTCTGG AAAGCCTCAA CGGTAAGCAA ACGGGGAGAA CATTAACTTC TGCGGGCAAA
GGTCGTCAGG CGGGAAGTTT GTAA
 
Protein sequence
MRLIITFLMA WCLSWGAYAA TAPDSKQITQ ELEQAKAAKP AQPEVVEALQ SALNALEERK 
GSLERIKQYQ QVIDNYPKLS ATLRAQLNNM RDEPRSVSPG MSTDALNQEI LQVSSQLLDK
SRQAQQEQER AREIADSLNQ LPQQQTDARR QLNEIERRLG TLTGNTPLNQ AQNFALQSDS
ARLKALVDEL ELAQLSANNR QELARLRSEL AEKESQQLDA YLQALRNQLN SQRQLEAERA
LESTELLAEN SADLPKDIVA QFKINRELSA ALNQQAQRMD LVASQQRQAT SQTLQVRQAL
NTLREQSQWL GSSNLLGEAL RAQVARLPEM PKPQQLDTEM AQLRVQRLRY EDLLNKQPLL
RQIHQADGQP LTAEQNRILE AQLRTQRELL NSLLQGGDTL LLELTKLKVS NGQLEDALKE
VNEATHRYLF WTSDVRPMTI AWPLEIAQDL RRLISLDTFS QLGKASVMML TSKETILPLF
GALILVGCSI YSRRYFTRFL ERSAAKVGKV TQDHFWLTLR TLFWSILVAS PLPVLWMTLG
YGLREAWPYP LAVAIGDGVT ATVPLLWVVM ICATFARPNG LFIAHFGWPR ERVSRGMRYY
LMSIGLIVPL IMALMMFDNL DDREFSGSLG RLCFILICGA LAVVTLSLKK AGIPLYLNKE
GSGDNITNHM LWNMMIGAPL VAILASAVGY LATAQALLAR LETSVAIWFL LLVVYHVIRR
WMLIQRRRLA FDRAKHRRAE MLAQRARGEE EAHHHSSPEG AIEVDESEVD LDAISAQSLR
LVRSILMLIA LLSVIVLWSE IHSAFGFLEN ISLWDVTSTV QGVESLEPIT LGAVLIAILV
FIITTQLVRN LPALLELAIL QHLDLTPGTG YAITTITKYL LMLIGGLVGF SMIGIEWSKL
QWLVAALGVG LGFGLQEIFA NFISGLIILF EKPIRIGDTV TIRDLTGSVT KINTRATTIS
DWDRKEIIVP NKAFITEQFI NWSLSDSVTR VVLTIPAPAD ANSEEVTEIL LTAARRCSLV
IDNPAPEVFL VDLQQGIQIF ELRIYAAEMG HRMPLRHEIH QLILAGFHAH GIDMPFPPFQ
MRLESLNGKQ TGRTLTSAGK GRQAGSL