Gene ECD_02533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02533 
SymbolproV 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2644674 
End bp2645876 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content49% 
IMG OID 
Productglycine betaine transporter subunit 
Protein accessionACT44352 
Protein GI253978682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.962653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATTA AATTAGAAAT TAAAAATCTT TATAAAATAT TTGGCGAGCA TCCACAGCGA 
GCGTTCAAAT ATATCGAACA AGGACTTTCA AAAGAACAAA TTCTGGAAAA AACTGGGCTA
TCGCTTGGCG TAAAAGACGC CAGTCTGGCC ATTGAAGAAG GCGAGATATT TGTCATCATG
GGATTATCCG GCTCGGGTAA ATCCACAATG GTACGCCTTC TCAATCGCCT GATTGAACCC
ACCCGCGGGC AAGTGCTGAT TGATGGTGTG GATATTGCCA AAATATCCGA CGCCGAACTC
CGTGAGGTGC GCAGAAAAAA GATTGCGATG GTCTTCCAGT CCTTTGCCTT AATGCCGCAT
ATGACCGTGC TGGACAATAC TGCGTTCGGT ATGGAATTGG CCGGAATTAA TGCCGAAGAA
CGCCGGGAAA AAGCCCTTGA TGCACTGCGT CAGGTCGGGC TGGAAAATTA TGCCCACAGC
TACCCGGATG AACTCTCTGG CGGGATGCGT CAACGTGTGG GATTAGCCCG CGCGTTAGCG
ATTAATCCGG ATATATTATT AATGGACGAA GCCTTCTCGG CGCTCGATCC ATTAATTCGC
ACCGAGATGC AGGATGAGCT GGTAAAATTA CAGGCGAAAC ATCAGCGCAC CATTGTCTTT
ATTTCCCACG ATCTTGATGA AGCCATGCGT ATTGGCGACC GAATTGCCAT TATGCAAAAT
GGTGAAGTGG TACAGGTCGG CACACCGGAT GAAATTCTCA ATAATCCGGC GAATGATTAT
GTCCGTACCT TCTTCCGTGG CGTTGATATT AGTCAGGTAT TCAGTGCGAA AGATATTGCC
CGCCGGACAC CGAATGGCTT AATTCGTAAA ACCCCTGGCT TCGGCCCACG TTCGGCACTG
AAATTATTGC AGGATGAAGA TCGCGAATAT GGCTACGTTA TCGAACGCGG TAATAAGTTT
GTCGGCGCAG TCTCCATCGA TTCGCTTAAA ACCGCGTTAA CGCAGCAGCA AGGTCTTGAT
GCGGCGCTGA TTGATGCGCC GTTAGCAGTC GATGCACAAA CGCCTCTTAG CGAGTTGCTC
TCTCATGTCG GACAGGCACC CTGTGCGGTG CCCGTGGTCG ACGAGGACCA ACAGTATGTC
GGCATCATTT CGAAAGGAAT GCTGCTGCGC GCTTTAGATC GTGAGGGGGT AAATAATGGC
TGA
 
Protein sequence
MAIKLEIKNL YKIFGEHPQR AFKYIEQGLS KEQILEKTGL SLGVKDASLA IEEGEIFVIM 
GLSGSGKSTM VRLLNRLIEP TRGQVLIDGV DIAKISDAEL REVRRKKIAM VFQSFALMPH
MTVLDNTAFG MELAGINAEE RREKALDALR QVGLENYAHS YPDELSGGMR QRVGLARALA
INPDILLMDE AFSALDPLIR TEMQDELVKL QAKHQRTIVF ISHDLDEAMR IGDRIAIMQN
GEVVQVGTPD EILNNPANDY VRTFFRGVDI SQVFSAKDIA RRTPNGLIRK TPGFGPRSAL
KLLQDEDREY GYVIERGNKF VGAVSIDSLK TALTQQQGLD AALIDAPLAV DAQTPLSELL
SHVGQAPCAV PVVDEDQQYV GIISKGMLLR ALDREGVNNG