Gene ECD_03991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03991 
SymbolmelB 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4250460 
End bp4251869 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content46% 
IMG OID 
Productmelibiose:sodium symporter 
Protein accessionACT45781 
Protein GI253980111 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAA AACTCAGTTA TGGATTTGGA GCGTTCGGGA AGGATTTTGC GATCGGCATT 
GTGTATATGT ACCTCATGTA TTACTACACC GATGTCGTCG GGCTGTCTGT GGGTTTGGTC
GGTACTTTGT TTCTGGTGGC GAGGATCTGG GATGCTATTA ACGATCCGAT TATGGGATGG
ATTGTAAATG CTACGCGATC GCGATGGGGT AAGTTCAAAC CCTGGATCCT GATCGGTACG
TTGGCAAACT CTGTAATCTT ATTTCTCCTC TTTAGTGCGC ATCTGTTTGA AGGTACTACT
CAGATTGTCT TTGTTTGCGT GACCTACATC CTCTGGGGCA TGACTTACAC CATTATGGAT
ATTCCCTTCT GGTCGCTGGT TCCAACCATA ACGCTCGATA AACGTGAGCG CGAACAACTG
GTTCCTTATC CGCGTTTTTT TGCCAGTCTG GCAGGCTTTG TTACGGCAGG TGTGACGCTA
CCATTTGTTA ATTATGTCGG CGGTGGCGAT CGGGGATTTG GCTTTCAGAT GTTCACTCTG
GTACTGATCG CCTTTTTTAT TGTTTCAACC ATCATCACTC TGCGCAATGT GCATGAAGTC
TTTTCGTCAG ACAATCAACC GTCTGCTGAA GGAAGCCATC TGACACTTAA AGCCATCGTT
GCGCTAATTT ATAAAAACGA TCAGCTTTCA TGCCTCTTGG GTATGGCTCT TGCTTATAAT
GTAGCCAGCA ACATTATTAC CGGCTTTGCT ATCTATTATT TCTCATATGT TATCGGTGAT
GCGGATTTGT TCCCCTATTA TCTGTCGTAT GCGGGAGCTG CTAACCTGGT GACGTTAGTA
TTCTTCCCAC GCTTAGTTAA ATCATTATCC CGACGCATTT TATGGGCCGG AGCATCTATT
CTTCCGGTGT TAAGCTGTGG TGTTCTCCTG TTAATGGCAT TAATGAGCTA TCACAACGTC
GTCCTCATTG TGATTGCGGG TATTTTGCTG AATGTGGGAA CGGCGCTTTT CTGGGTATTA
CAGGTCATCA TGGTGGCAGA TACCGTTGAT TACGGTGAAT ATAAACTGCA CGTACGCTGT
GAAAGTATCG CTTACTCCGT GCAGACTATG GTGGTGAAGG GCGGTTCAGC CTTTGCGGCT
TTTTTCATTG CGGTTGTGTT AGGGATGATT GGCTATGTAC CGAATGTTGA ACAGTCTACG
CAAGCCCTAT TAGGTATGCA GTTTATTATG ATTGCTCTAC CAACTCTGTT TTTCATGGTA
ACGCTGATTC TCTACTTCCG TTTCTATCGC CTCAATGGTG ACACGCTGCG CAGGATCCAG
ATCCATCTGC TGGATAAATA TCGCAAAGTA CCGCCCGAGC CTGTTCATGC TGATATTCCG
GTCGGTGCAG TGAGTGATGT GAAAGCCTGA
 
Protein sequence
MTTKLSYGFG AFGKDFAIGI VYMYLMYYYT DVVGLSVGLV GTLFLVARIW DAINDPIMGW 
IVNATRSRWG KFKPWILIGT LANSVILFLL FSAHLFEGTT QIVFVCVTYI LWGMTYTIMD
IPFWSLVPTI TLDKREREQL VPYPRFFASL AGFVTAGVTL PFVNYVGGGD RGFGFQMFTL
VLIAFFIVST IITLRNVHEV FSSDNQPSAE GSHLTLKAIV ALIYKNDQLS CLLGMALAYN
VASNIITGFA IYYFSYVIGD ADLFPYYLSY AGAANLVTLV FFPRLVKSLS RRILWAGASI
LPVLSCGVLL LMALMSYHNV VLIVIAGILL NVGTALFWVL QVIMVADTVD YGEYKLHVRC
ESIAYSVQTM VVKGGSAFAA FFIAVVLGMI GYVPNVEQST QALLGMQFIM IALPTLFFMV
TLILYFRFYR LNGDTLRRIQ IHLLDKYRKV PPEPVHADIP VGAVSDVKA