Gene EcSMS35_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3699 
SymbolmalT 
ID6143219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3760617 
End bp3763322 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content54% 
IMG OID641618526 
Producttranscriptional regulator MalT 
Protein accessionYP_001745666 
Protein GI170683264 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.203267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.15955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATTC CGTCAAAATT AAGTCGTCCG GTTCGACTCG ACCATACCGT GGTTCGTGAG 
CGCCTGCTGG CTAAACTTTC CGGCGCGAAC AACTTCCGGC TGGCGCTGAT CACAAGTCCT
GCGGGCTACG GAAAGACCAC GCTCATTTCC CAGTGGGCGG CAGGCAAAAA CGATATCGGC
TGGTACTCGC TGGATGAAGG TGATAACCAG CAAGAGCGTT TCGCCAGCTA TCTCATTGCC
GCCGTGCAGC AGGCAACCAA CGGTCACTGT GCGATATGTG AGACAATGGC GCAAAAACGG
CAATATGCCA GCCTGACGTC ACTCTTCGCC CAGCTTTTCA TTGAGCTGGC GGAATGGCAT
AGCCCACTTT ATCTGGTCAT CGATGACTAT CATCTGATCA CTAATCCAGT GATCCACGAG
TCAATGCGCT TCTTTATTCG CCATCAACCA GAAAATCTCA CCCTGGTGGT GTTGTCACGC
AACCTTCCGC AACTGGGCAT TGCCAATCTG CGTGTTCGTG ACCAACTGCT GGAAATTGGC
AGTCAGCAAC TGGCATTTAC CCATCAGGAA GCGAAGCAGT TTTTTGATTG CCGTCTGTCA
TCGCCGATTG AAGCCGCAGA AAGCAGTCGG ATTTGTGATG ACGTTTCCGG TTGGGCGACG
GCACTGCAGC TAATCGCCCT CTCCGCCCGG CAGAATACCC ACTCAGCCCA TAAGTCGGCA
CGCCGCCTGG CGGGAATCAA TGCCAGCCAT CTTTCGGATT ATCTGGTCGA TGAGGTTTTG
GATAACGTCG ATCTCGCAAC GCGCCATTTT CTGTTGAAAA GCGCCATTTT GCGCTCAATG
AACGACGCAC TCATCAACCG TGTGACCGGC GAAGAAAACG GGCAAATGCG TCTCGAAGAG
ATTGAGCGCC AGGGGCTGTT TTTACAGCGA ATGGATGATA CCGGCGAGTG GTTCTGCTAT
CACCCGCTGT TTGGTAACTT CCTGCGTCAG CGCTGCCAGT GGGAACTGGC GGCGGAGCTG
CCGGAAATCC ACCGTGCCGC CGCAGAAAGC TGGATGGCCC AGGGATTCCC CAGCGAAGCG
ATTCATCATG CGCTGGCGGC AGGCGATGCG CTGATGCTGC GCGATATTCT GCTTAATCAC
GCCTGGAGTC TGTTCAACCA TAGCGAACTG TCGCTGCTGG AAGAGTCGCT TAAGGCCCTG
CCGTGGGACA GTTTGCTGGA AAATCCGCAG TTAGTGTTAT TGCAGGCGTG GCTGATGCAA
AGCCAACATC GCTACGGCGA AGTTAACACC CTGCTAGCCC GTGCTGAACA TGAAATCAAG
GACATCAGAG AAGGCACCAT GCACGCAGAA TTTAACGCTC TGCGCGCCCA GGTGGCGATT
AACGATGGTA ACCCGGATGA AGCGGAACGG CTGGCAAAAC TGGCACTGGA AGAGCTGCCG
CCGGGCTGGT TCTATAGCCG CATTGTAGCA ACCTCAGTGC TGGGTGAAGT ATTGCACTGC
AAAGGCGAAT TGACCCGCTC ACTGGCGTTA ATGCAGCAAA CCGAACAGAT GGCACGCCAG
CACGATGTCT GGCACTACGC CTTATGGAGT TTAATCCAGC AAAGTGAAAT TCTGTTTGCT
CAGGGATTCC TGCAAACCGC GTGGGAAACG CAGGAAAAAG CGTTCCAGCT GATCAACGAG
CAGCATCTGG AACAGCTGCC AATGCATGAG TTTCTGGTGC GCATTCGTGC GCAGCTGTTA
TGGGCCTGGG CACGGCTGGA TGAAGCCGAA GCGTCAGCAC GTAGCGGGAT TGAAGTCTTG
TCGTCTTATC AGCCACAGCA ACAGCTTCAG TGCCTGGCAA TGTTGATTCA ATGCTCGCTG
GCCCGTGGTG ATTTAGATAA CGCCCGTAGC CAGCTTAACC GTCTGGAAAA CCTGCTGGGG
AATGGCAAAT ATCACAGCGA CTGGATCTCT AACGCCAACA AAGTCCGGGT GATTTACTGG
CAAATGACCG GCGATAAAGC CGCCGCTGCC AACTGGTTGC GTCATACGGC TAAACCGGAG
TTTGCGAACA ACCACTTCCT GCAAGGTCAA TGGCGCAACA TTGCCCGTGC ACAAATCTTG
CTGGGCGAGT TTGAACCGGC AGAAATTGTT CTCGAAGAAC TCAATGAAAA TGCCCGCAGT
CTGCGGTTGA TGAGCGATCT CAACCGTAAC CTGTTGCTGC TTAATCAACT GTACTGGCAG
GCCGGACGTA AAAGTGACGC CCAGCGTGTG TTGCTGGACG CATTAAAACT GGCGAATCGC
ACCGGATTTA TCAGTCATTT TGTCATCGAA GGCGAAGCGA TGGCGCAACA ACTGCGCCAG
TTGATTCAGC TTAATACGCT GCCGGAACTG GAACAGCATC GTGCGCAGCG TATTCTGCGA
GAAATCAATC AACATCATCG GCATAAATTC GCCCATTTCG ATGAGAATTT CGTTGAACGT
CTGCTAAATC ATCCGGAAGT GCCAGAACTG ATCCGCACCA GCCCGCTGAC CCAACGTGAA
TGGCAGGTCC TGGGGCTGAT TTACTCCGGT TACAGCAACG AGCAAATTGC AGGCGAGCTG
GAAGTGGCGG CAACCACCAT CAAAACGCAT ATCCGCAATC TGTACCAAAA ACTCGGTGTC
GCCCACCGTC AGGCCGCGGT ACAACACGCG CAGAAACTGC TGAAGATGAT GGGATATGGG
GTATGA
 
Protein sequence
MLIPSKLSRP VRLDHTVVRE RLLAKLSGAN NFRLALITSP AGYGKTTLIS QWAAGKNDIG 
WYSLDEGDNQ QERFASYLIA AVQQATNGHC AICETMAQKR QYASLTSLFA QLFIELAEWH
SPLYLVIDDY HLITNPVIHE SMRFFIRHQP ENLTLVVLSR NLPQLGIANL RVRDQLLEIG
SQQLAFTHQE AKQFFDCRLS SPIEAAESSR ICDDVSGWAT ALQLIALSAR QNTHSAHKSA
RRLAGINASH LSDYLVDEVL DNVDLATRHF LLKSAILRSM NDALINRVTG EENGQMRLEE
IERQGLFLQR MDDTGEWFCY HPLFGNFLRQ RCQWELAAEL PEIHRAAAES WMAQGFPSEA
IHHALAAGDA LMLRDILLNH AWSLFNHSEL SLLEESLKAL PWDSLLENPQ LVLLQAWLMQ
SQHRYGEVNT LLARAEHEIK DIREGTMHAE FNALRAQVAI NDGNPDEAER LAKLALEELP
PGWFYSRIVA TSVLGEVLHC KGELTRSLAL MQQTEQMARQ HDVWHYALWS LIQQSEILFA
QGFLQTAWET QEKAFQLINE QHLEQLPMHE FLVRIRAQLL WAWARLDEAE ASARSGIEVL
SSYQPQQQLQ CLAMLIQCSL ARGDLDNARS QLNRLENLLG NGKYHSDWIS NANKVRVIYW
QMTGDKAAAA NWLRHTAKPE FANNHFLQGQ WRNIARAQIL LGEFEPAEIV LEELNENARS
LRLMSDLNRN LLLLNQLYWQ AGRKSDAQRV LLDALKLANR TGFISHFVIE GEAMAQQLRQ
LIQLNTLPEL EQHRAQRILR EINQHHRHKF AHFDENFVER LLNHPEVPEL IRTSPLTQRE
WQVLGLIYSG YSNEQIAGEL EVAATTIKTH IRNLYQKLGV AHRQAAVQHA QKLLKMMGYG
V