Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3699 |
Symbol | malT |
ID | 6143219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3760617 |
End bp | 3763322 |
Gene Length | 2706 bp |
Protein Length | 901 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618526 |
Product | transcriptional regulator MalT |
Protein accession | YP_001745666 |
Protein GI | 170683264 |
COG category | [K] Transcription |
COG ID | [COG2909] ATP-dependent transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.203267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.15955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGATTC CGTCAAAATT AAGTCGTCCG GTTCGACTCG ACCATACCGT GGTTCGTGAG CGCCTGCTGG CTAAACTTTC CGGCGCGAAC AACTTCCGGC TGGCGCTGAT CACAAGTCCT GCGGGCTACG GAAAGACCAC GCTCATTTCC CAGTGGGCGG CAGGCAAAAA CGATATCGGC TGGTACTCGC TGGATGAAGG TGATAACCAG CAAGAGCGTT TCGCCAGCTA TCTCATTGCC GCCGTGCAGC AGGCAACCAA CGGTCACTGT GCGATATGTG AGACAATGGC GCAAAAACGG CAATATGCCA GCCTGACGTC ACTCTTCGCC CAGCTTTTCA TTGAGCTGGC GGAATGGCAT AGCCCACTTT ATCTGGTCAT CGATGACTAT CATCTGATCA CTAATCCAGT GATCCACGAG TCAATGCGCT TCTTTATTCG CCATCAACCA GAAAATCTCA CCCTGGTGGT GTTGTCACGC AACCTTCCGC AACTGGGCAT TGCCAATCTG CGTGTTCGTG ACCAACTGCT GGAAATTGGC AGTCAGCAAC TGGCATTTAC CCATCAGGAA GCGAAGCAGT TTTTTGATTG CCGTCTGTCA TCGCCGATTG AAGCCGCAGA AAGCAGTCGG ATTTGTGATG ACGTTTCCGG TTGGGCGACG GCACTGCAGC TAATCGCCCT CTCCGCCCGG CAGAATACCC ACTCAGCCCA TAAGTCGGCA CGCCGCCTGG CGGGAATCAA TGCCAGCCAT CTTTCGGATT ATCTGGTCGA TGAGGTTTTG GATAACGTCG ATCTCGCAAC GCGCCATTTT CTGTTGAAAA GCGCCATTTT GCGCTCAATG AACGACGCAC TCATCAACCG TGTGACCGGC GAAGAAAACG GGCAAATGCG TCTCGAAGAG ATTGAGCGCC AGGGGCTGTT TTTACAGCGA ATGGATGATA CCGGCGAGTG GTTCTGCTAT CACCCGCTGT TTGGTAACTT CCTGCGTCAG CGCTGCCAGT GGGAACTGGC GGCGGAGCTG CCGGAAATCC ACCGTGCCGC CGCAGAAAGC TGGATGGCCC AGGGATTCCC CAGCGAAGCG ATTCATCATG CGCTGGCGGC AGGCGATGCG CTGATGCTGC GCGATATTCT GCTTAATCAC GCCTGGAGTC TGTTCAACCA TAGCGAACTG TCGCTGCTGG AAGAGTCGCT TAAGGCCCTG CCGTGGGACA GTTTGCTGGA AAATCCGCAG TTAGTGTTAT TGCAGGCGTG GCTGATGCAA AGCCAACATC GCTACGGCGA AGTTAACACC CTGCTAGCCC GTGCTGAACA TGAAATCAAG GACATCAGAG AAGGCACCAT GCACGCAGAA TTTAACGCTC TGCGCGCCCA GGTGGCGATT AACGATGGTA ACCCGGATGA AGCGGAACGG CTGGCAAAAC TGGCACTGGA AGAGCTGCCG CCGGGCTGGT TCTATAGCCG CATTGTAGCA ACCTCAGTGC TGGGTGAAGT ATTGCACTGC AAAGGCGAAT TGACCCGCTC ACTGGCGTTA ATGCAGCAAA CCGAACAGAT GGCACGCCAG CACGATGTCT GGCACTACGC CTTATGGAGT TTAATCCAGC AAAGTGAAAT TCTGTTTGCT CAGGGATTCC TGCAAACCGC GTGGGAAACG CAGGAAAAAG CGTTCCAGCT GATCAACGAG CAGCATCTGG AACAGCTGCC AATGCATGAG TTTCTGGTGC GCATTCGTGC GCAGCTGTTA TGGGCCTGGG CACGGCTGGA TGAAGCCGAA GCGTCAGCAC GTAGCGGGAT TGAAGTCTTG TCGTCTTATC AGCCACAGCA ACAGCTTCAG TGCCTGGCAA TGTTGATTCA ATGCTCGCTG GCCCGTGGTG ATTTAGATAA CGCCCGTAGC CAGCTTAACC GTCTGGAAAA CCTGCTGGGG AATGGCAAAT ATCACAGCGA CTGGATCTCT AACGCCAACA AAGTCCGGGT GATTTACTGG CAAATGACCG GCGATAAAGC CGCCGCTGCC AACTGGTTGC GTCATACGGC TAAACCGGAG TTTGCGAACA ACCACTTCCT GCAAGGTCAA TGGCGCAACA TTGCCCGTGC ACAAATCTTG CTGGGCGAGT TTGAACCGGC AGAAATTGTT CTCGAAGAAC TCAATGAAAA TGCCCGCAGT CTGCGGTTGA TGAGCGATCT CAACCGTAAC CTGTTGCTGC TTAATCAACT GTACTGGCAG GCCGGACGTA AAAGTGACGC CCAGCGTGTG TTGCTGGACG CATTAAAACT GGCGAATCGC ACCGGATTTA TCAGTCATTT TGTCATCGAA GGCGAAGCGA TGGCGCAACA ACTGCGCCAG TTGATTCAGC TTAATACGCT GCCGGAACTG GAACAGCATC GTGCGCAGCG TATTCTGCGA GAAATCAATC AACATCATCG GCATAAATTC GCCCATTTCG ATGAGAATTT CGTTGAACGT CTGCTAAATC ATCCGGAAGT GCCAGAACTG ATCCGCACCA GCCCGCTGAC CCAACGTGAA TGGCAGGTCC TGGGGCTGAT TTACTCCGGT TACAGCAACG AGCAAATTGC AGGCGAGCTG GAAGTGGCGG CAACCACCAT CAAAACGCAT ATCCGCAATC TGTACCAAAA ACTCGGTGTC GCCCACCGTC AGGCCGCGGT ACAACACGCG CAGAAACTGC TGAAGATGAT GGGATATGGG GTATGA
|
Protein sequence | MLIPSKLSRP VRLDHTVVRE RLLAKLSGAN NFRLALITSP AGYGKTTLIS QWAAGKNDIG WYSLDEGDNQ QERFASYLIA AVQQATNGHC AICETMAQKR QYASLTSLFA QLFIELAEWH SPLYLVIDDY HLITNPVIHE SMRFFIRHQP ENLTLVVLSR NLPQLGIANL RVRDQLLEIG SQQLAFTHQE AKQFFDCRLS SPIEAAESSR ICDDVSGWAT ALQLIALSAR QNTHSAHKSA RRLAGINASH LSDYLVDEVL DNVDLATRHF LLKSAILRSM NDALINRVTG EENGQMRLEE IERQGLFLQR MDDTGEWFCY HPLFGNFLRQ RCQWELAAEL PEIHRAAAES WMAQGFPSEA IHHALAAGDA LMLRDILLNH AWSLFNHSEL SLLEESLKAL PWDSLLENPQ LVLLQAWLMQ SQHRYGEVNT LLARAEHEIK DIREGTMHAE FNALRAQVAI NDGNPDEAER LAKLALEELP PGWFYSRIVA TSVLGEVLHC KGELTRSLAL MQQTEQMARQ HDVWHYALWS LIQQSEILFA QGFLQTAWET QEKAFQLINE QHLEQLPMHE FLVRIRAQLL WAWARLDEAE ASARSGIEVL SSYQPQQQLQ CLAMLIQCSL ARGDLDNARS QLNRLENLLG NGKYHSDWIS NANKVRVIYW QMTGDKAAAA NWLRHTAKPE FANNHFLQGQ WRNIARAQIL LGEFEPAEIV LEELNENARS LRLMSDLNRN LLLLNQLYWQ AGRKSDAQRV LLDALKLANR TGFISHFVIE GEAMAQQLRQ LIQLNTLPEL EQHRAQRILR EINQHHRHKF AHFDENFVER LLNHPEVPEL IRTSPLTQRE WQVLGLIYSG YSNEQIAGEL EVAATTIKTH IRNLYQKLGV AHRQAAVQHA QKLLKMMGYG V
|
| |