Gene EcolC_3851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3851 
Symbol 
ID6067446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4207706 
End bp4211029 
Gene Length3324 bp 
Protein Length1107 aa 
Translation table11 
GC content56% 
IMG OID641603266 
Producthypothetical protein 
Protein accessionYP_001726782 
Protein GI170021828 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3264] Small-conductance mechanosensitive channel 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00332968 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCCTGA TTATCACTTT TCTGATGGCC TGGTGCCTCA GTTGGGGGGC GTACGCCGCG 
ACGGCCCCCG ATAGCAAACA AATCACTCAG GAACTGGAGC AGGCAAAAGC GGCGAAACCC
GCACAGCCGG AAGTCGTAGA GGCGCTCCAG TCTGCCTTAA ATGCGCTTGA GGAACGAAAA
GGTTCCCTTG AGCGCATCAA ACAATATCAG CAAGTTATCG ATAATTATCC GAAACTCTCC
GCTACTCTGC GCGCACAATT AAACAACATG CGTGACGAGC CGCGCAGCGT GTCGCCGGGA
ATGTCTACCG ACGCGCTGAA TCAGGAAATT CTCCAGGTCA GCAGCCAGTT GCTGGATAAA
AGCCGTCAGG CCCAGCAAGA GCAGGAGCGC GCCCGCGAGA TTGCCGATTC GCTGAATCAA
CTGCCGCAAC AGCAAACCGA CGCCCGCCGT CAGTTAAATG AGATCGAGCG CCGCCTGGGA
ACGCTTACCG GCAATACTCC GCTCAATCAG GCACAAAATT TCGCGTTGCA GTCTGACTCT
GCACGTCTTA AGGCGCTCGT TGATGAACTG GAGCTGGCGC AGCTGTCTGC CAATAACCGC
CAGGAATTAG CGCGCTTACG CTCAGAGCTG GCGGAAAAAG AGAGCCAGCA ACTGGATGCG
TATTTGCAGG CCTTGCGTAA TCAATTAAAC AGCCAACGTC AGCTTGAGGC GGAGCGGGCG
CTGGAAAGTA CCGAATTGCT GGCAGAAAAC AGCGCCGATT TGCCGAAAGA TATCGTCGCG
CAATTCAAAA TTAACCGCGA ACTATCGGCG GCTTTGAATC AACAGGCGCA GCGGATGGAT
CTCATTGCCT CGCAACAGCG TCAGGCTGCC AGCCAGACGT TACAGGTCCG GCAGGCGTTG
AATACGCTGC GTGAACAGTC GCAATGGCTG GGATCGTCCA ATCTGCTCGG CGAAGCGCTG
CGGGCGCAGG TGGCACGGCT GCCGGAAATG CCGAAACCAC AACAGCTTGA TACCGAAATG
GCGCAGTTGC GTGTGCAACG GTTACGTTAT GAGGATCTGC TTAATAAACA GCCGCTGCTA
CGGCAAATTC ATCAGGCCGA CGGTCAGCCG CTGACTGCCG AGCAAAACCG TATTCTGGAA
GCACAACTGC GCACTCAGCG TGAGTTGCTG AACTCATTGT TGCAGGGTGG CGACACGCTA
CTATTGGAAC TGACCAAGCT GAAAGTCTCC AACGGGCAAC TGGAGGATGC GCTGAAAGAG
GTGAACGAAG CAACGCACCG CTATCTGTTC TGGACCTCTG ACGTGCGCCC GATGACCATC
GCCTGGCCGC TGGAAATCGC CCAGGATCTG CGTCGTCTCA TTTCGCTGGA CACCTTCAGT
CAGTTGGGCA AAGCCAGTGT GATGATGCTG ACCAGCAAAG AGACAATTTT GCCGCTGTTT
GGCGCGTTGA TTCTGGTCGG TTGCAGTATT TACTCGCGCC GCTATTTCAC CCGTTTTCTT
GAACGTTCGG CGGCGAAAGT TGGCAAAGTG ACTCAGGATC ACTTCTGGCT GACGTTGCGC
ACTCTTTTCT GGTCGATTCT CGTCGCGTCA CCGTTACCGG TGCTGTGGAT GACGCTGGGT
TACGGCTTGC GCGAGGCGTG GCCTTATCCG CTGGCGGTCG CGATTGGCGA TGGTGTAACG
GCCACCGTGC CGCTGCTGTG GGTAGTGATG ATTTGCGCCA CCTTTGCCCG CCCGAACGGC
TTGTTTATCG CTCATTTTGG CTGGCCGCGC GAACGTGTTT CCCGTGGGAT GCGCTACTAC
CTGATGAGCA TCGGGCTTAT TGTGCCGCTG ATTATGGCGC TGATGATGTT CGATAACCTC
GACGACCGTG AATTCTCCGG TTCGCTGGGA CGGCTTTGCT TTATCCTCAT TTGCGGTGCG
CTGGCGGTGG TCACCCTCAG CCTGAAAAAG GCCGGGATTC CGCTGTATCT CAACAAAGAG
GGCAGCGGCG ACAACATTAC CAACCATATG CTGTGGAACA TGATGATTGG CGCGCCGTTG
GTTGCCATTC TGGCGTCGGC GGTGGGTTAT CTGGCAACGG CACAGGCGCT GTTAGCGAGG
CTTGAAACCT CGGTTGCCAT CTGGTTCCTG CTACTGGTGG TTTATCACGT TATCCGCCGC
TGGATGCTGA TCCAGCGCCG CAGGCTGGCG TTTGATCGGG CGAAGCATCG CCGGGCAGAG
ATGTTAGCGC AACGTGCGCG TGGCGAAGAG GAAGCGCATC ATCACAGTAG CCCGGAAGGA
GCAATTGAAG TCGATGAAAG CGAAGTCGAT CTCGATGCCA TCAGTGCGCA ATCCTTGCGG
CTGGTGCGCT CAATTTTGAT GTTGATCGCC CTGCTTTCTG TCATTGTGCT GTGGTCAGAA
ATCCATTCCG CTTTCGGCTT CCTCGAAAAT ATTTCGCTGT GGGATGTCAC CTCCACGGTA
CAGGGCGTAG AAAGTCTGGA GCCAATTACC CTCGGTGCGG TGCTGATTGC CATTCTGGTG
TTTATCATCA CCACGCAGCT GGTGCGCAAC TTGCCCGCGC TGCTGGAACT GGCGATTTTG
CAGCACCTGG ATTTAACGCC GGGTACGGGT TACGCCATCA CCACCATCAC CAAATATCTG
CTGATGCTGA TTGGCGGGCT GGTCGGCTTC TCAATGATTG GTATTGAGTG GTCGAAATTG
CAGTGGCTGG TTGCCGCGCT CGGTGTTGGT CTCGGTTTTG GTTTGCAGGA AATTTTCGCC
AACTTTATCT CTGGTCTGAT TATCCTGTTC GAAAAACCGA TTCGCATTGG CGATACGGTG
ACAATTCGCG ATCTCACCGG TAGCGTGACG AAAATTAACA CCCGCGCCAC CACCATCAGC
GACTGGGACC GTAAAGAGAT AATCGTGCCG AACAAGGCGT TTATTACCGA GCAGTTTATC
AACTGGTCGC TCTCTGACTC GGTCACGCGC GTGGTGTTGA CGATACCGGC CCCTGCCGAT
GCCAATAGCG AAGAAGTGAC GGAAATCCTG CTCACCGCAG CGCGTCGCTG CTCGCTGGTG
ATCGACAACC CGGCACCGGA AGTCTTCCTG GTGGATCTGC AACAGGGGAT TCAGATTTTC
GAGCTGCGTA TTTACGCCGC TGAGATGGGT CACCGTATGC CGCTACGCCA TGAGATCCAC
CAGCTGATTC TGGCTGGCTT CCATGCCCAC GGTATCGATA TGCCATTCCC GCCCTTCCAG
ATGCGTCTGG AAAGCCTCAA CGGTAAACAA ACGGGGAGAA CGCTGACGTC TGCGGGCAAA
GGTCGTCAGG CGGGAAGTTT GTAA
 
Protein sequence
MRLIITFLMA WCLSWGAYAA TAPDSKQITQ ELEQAKAAKP AQPEVVEALQ SALNALEERK 
GSLERIKQYQ QVIDNYPKLS ATLRAQLNNM RDEPRSVSPG MSTDALNQEI LQVSSQLLDK
SRQAQQEQER AREIADSLNQ LPQQQTDARR QLNEIERRLG TLTGNTPLNQ AQNFALQSDS
ARLKALVDEL ELAQLSANNR QELARLRSEL AEKESQQLDA YLQALRNQLN SQRQLEAERA
LESTELLAEN SADLPKDIVA QFKINRELSA ALNQQAQRMD LIASQQRQAA SQTLQVRQAL
NTLREQSQWL GSSNLLGEAL RAQVARLPEM PKPQQLDTEM AQLRVQRLRY EDLLNKQPLL
RQIHQADGQP LTAEQNRILE AQLRTQRELL NSLLQGGDTL LLELTKLKVS NGQLEDALKE
VNEATHRYLF WTSDVRPMTI AWPLEIAQDL RRLISLDTFS QLGKASVMML TSKETILPLF
GALILVGCSI YSRRYFTRFL ERSAAKVGKV TQDHFWLTLR TLFWSILVAS PLPVLWMTLG
YGLREAWPYP LAVAIGDGVT ATVPLLWVVM ICATFARPNG LFIAHFGWPR ERVSRGMRYY
LMSIGLIVPL IMALMMFDNL DDREFSGSLG RLCFILICGA LAVVTLSLKK AGIPLYLNKE
GSGDNITNHM LWNMMIGAPL VAILASAVGY LATAQALLAR LETSVAIWFL LLVVYHVIRR
WMLIQRRRLA FDRAKHRRAE MLAQRARGEE EAHHHSSPEG AIEVDESEVD LDAISAQSLR
LVRSILMLIA LLSVIVLWSE IHSAFGFLEN ISLWDVTSTV QGVESLEPIT LGAVLIAILV
FIITTQLVRN LPALLELAIL QHLDLTPGTG YAITTITKYL LMLIGGLVGF SMIGIEWSKL
QWLVAALGVG LGFGLQEIFA NFISGLIILF EKPIRIGDTV TIRDLTGSVT KINTRATTIS
DWDRKEIIVP NKAFITEQFI NWSLSDSVTR VVLTIPAPAD ANSEEVTEIL LTAARRCSLV
IDNPAPEVFL VDLQQGIQIF ELRIYAAEMG HRMPLRHEIH QLILAGFHAH GIDMPFPPFQ
MRLESLNGKQ TGRTLTSAGK GRQAGSL