Gene ECD_04148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04148 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4419032 
End bp4421035 
Gene Length2004 bp 
Protein Length667 aa 
Translation table11 
GC content50% 
IMG OID 
Productsecondary glycine betaine transporter BetU 
Protein accessionACT45933 
Protein GI253980263 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTATGA GTGAAAATGA TACAATCCCA AAGAAGTCTA CAAGTCAGAT TAACAAAGCG 
GTATTCTTTA CATCTGCTTT GCTAATTTTC CTTCTTGTCG CCTTTGCCGC CGTATTCCCG
GATGTCGCCG ACAAAAATTT TAAACTACTT CAGCAACAAA TCTTCACGAA TGCCAGCTGG
TTCTACATCC TTGCTGTGGC CCTGATTTTA CTGAGTGTCA CGTTCCTTGG ACTCTCACGC
TACGGTGATA TCAAGCTGGG CCCGGACCAT GCGCAGCCTG ATTTCAGCTA CCACTCCTGG
TTTGCGATGC TTTTTTCGGC AGGGATGGGG ATCGGCCTGA TGTTCTTTGG CGTTGCCGAA
CCTGTAATGC ATTATCTTTC GCCACCCGTT GGCACTCCAG AAACCGTTGC GGCAGCCAAG
GAAGCAATGC GTCTGACCTT TTTCCACTGG GGACTGCACG CATGGGCAAT TTATGCCATT
GTGGCGCTGA TTCTGGCGTT CTTCAGTTAC CGTCACGGTC TGCCTTTAAC TCTGCGCTCC
GCACTCTATC CCATTATTGG CGATCGCATA TACGGACCTG TAGGACATGC GGTTGATATT
TTCGCTGTTA TAGGCACGGT CTTTGGCGTT GCGACATCAC TGGGTTACGG TGTTTTGCAG
GTGAATGCCG GTTTGAACCA TCTTTTCGGG GTGCCCATCA ATGAAACGGT GCAGGTAATT
CTGATCGTGG TCATCACGGG GTTAGCGACG ATTTCAGTGG TGTCCGGTCT GGATAAGGGA
ATACGTATCC TGTCTGAACT CAATCTGGGT CTGGCTTTGT TGCTCCTGGC GCTGGTCCTG
TGTCTGGGAC CAACCGTGCT TCTGCTGAAG TCATTTGTGG AAAATACGGG CGGTTATCTT
TCGGAACTGG TGAGTAAAAC GTTCAACCTT TACGCGTATG AGCCCAAGTC GAGCAACTGG
CTGGGGGGCT GGACATTACT GTACTGGGGA TGGTGGCTTT CATGGTCGCC GTTTGTGGGG
ATGTTCATCG CACGGGTCTC CCGCGGGCGA ACCATTCGCG AGTTTGTCAC CGGCGTGCTG
TTTGTTCCCG CGGGTTTTAC GCTAATGTGG ATGACGGTGT TTGGTAACAG CGCGATCTAT
CTCATTATGA ACCAGGGGGC CACAGACCTC GCCAATACTG TTCAGCAGGA TGTGTCGCTG
GCCCTGTTTA ATTTCCTGGA GCATTTCCCG TTCTCTTCTG TGCTGTCATT CATTGCAATG
GCGATGGTCA TCGTCTTCTT TGTAACGTCT GCTGATTCGG GGGCAATGGT TGTGGATACT
CTGGCATCAG GTGGAGTGGC AAACACACCC GTCTGGCAGC GAATATTCTG GGCCTCGCTC
ATGGGCATTG TTGCAATTGC GCTTCTCCTT GCCGGAGGGC TAAGTGCGCT GCAAACGGTG
ACAATAGCGA GTGCATTGCC CTTCTCAGTG ATCTTACTAA TATCCATATA CGGACTTTTA
AAAGCTTTGC GCCGGGATTT GACCAAGCGT GAAAGCCTGA GCATGGCGAC AATTGCTCCT
ACGGCTGCAC GTAACCCAAT TCCTTGGCAG AGAAGGTTAC GCAATATCGC GTATCTGCCG
AAGCGATCTC TTGTGAAACG TTTTATGGAC GACGTTATCC AGCCCGCCAT GACGCTGGTT
CAGGAGGAAC TGAACAAGCA GGGGACGATA AGCCACATTA GTGATGCAGT CGACGATCGT
ATTCGTCTTG AAGTCGATTT GGGCAACGAG CTGAATTTTA TATATGAAGT GAGGCTTCGC
GGGTATATCT CACCGACCTT CGCGCTCGCC GCAATGGATA ATGATGAGCA GCAGACTGAA
CAACATCGAT ATTATCGCGC TGAGGTTTAT CTCAAAGAAG GCGGTCAAAA TTATGATGTG
ATGGGCTGGA ACCAGGAACA GCTGATTAAT GACATACTGG ACCAGTACGA AAAACACCTG
CACTTCCTGC ACCTGGTTCG TTAA
 
Protein sequence
MIMSENDTIP KKSTSQINKA VFFTSALLIF LLVAFAAVFP DVADKNFKLL QQQIFTNASW 
FYILAVALIL LSVTFLGLSR YGDIKLGPDH AQPDFSYHSW FAMLFSAGMG IGLMFFGVAE
PVMHYLSPPV GTPETVAAAK EAMRLTFFHW GLHAWAIYAI VALILAFFSY RHGLPLTLRS
ALYPIIGDRI YGPVGHAVDI FAVIGTVFGV ATSLGYGVLQ VNAGLNHLFG VPINETVQVI
LIVVITGLAT ISVVSGLDKG IRILSELNLG LALLLLALVL CLGPTVLLLK SFVENTGGYL
SELVSKTFNL YAYEPKSSNW LGGWTLLYWG WWLSWSPFVG MFIARVSRGR TIREFVTGVL
FVPAGFTLMW MTVFGNSAIY LIMNQGATDL ANTVQQDVSL ALFNFLEHFP FSSVLSFIAM
AMVIVFFVTS ADSGAMVVDT LASGGVANTP VWQRIFWASL MGIVAIALLL AGGLSALQTV
TIASALPFSV ILLISIYGLL KALRRDLTKR ESLSMATIAP TAARNPIPWQ RRLRNIAYLP
KRSLVKRFMD DVIQPAMTLV QEELNKQGTI SHISDAVDDR IRLEVDLGNE LNFIYEVRLR
GYISPTFALA AMDNDEQQTE QHRYYRAEVY LKEGGQNYDV MGWNQEQLIN DILDQYEKHL
HFLHLVR