Gene ECD_02093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02093 
SymbolyeiM 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2150933 
End bp2152183 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content50% 
IMG OID 
Productpredicted nucleoside transporter 
Protein accessionACT43916 
Protein GI253978246 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATAA TGAGAAGTGT TGTGGGGATG GTGGTGTTAC TGGCAATAGC ATTTCTGTTG 
TCAGTGAATA AAAAGAGCAT CAGTTTGCGT ACGGTTGGAG CCGCACTGCT GCTGCAAATC
GCTATTGGTG GCATCATGCT CTACTTCCCA CCGGGAAAAT GGGCAGTAGA ACAGGCGGCA
TTAGGCGTTC ATAAAGTGAT GTCTTACAGT GATGCCGGTA GCGCCTTCAT TTTTGGTTCG
CTGGTTGGGC CGAAAATGGA TGTCCTGTTT GACGGTGCGG GTTTTATCTT CGCCTTTCGC
GTACTTCCGG CGATTATTTT CGTTACTGCG CTCATCAGTC TGCTGTACTA CATTGGCGTG
ATGGGGCTGC TGATTCGCAT CCTTGGCAGC ATTTTCCAGA AAGCTCTTAA CATCAGCAAA
ATCGAATCTT TTGTCGCAGT CACAACTATT TTCCTCAGGC AAAATGAGAT CCCGGCGATC
GTTAAACCGT TTATCGATCG CATGAATCGC AACGAGTTGT TTACCGCAAT TTGTAGCGGG
ATGGCGTCCA TTGCTGGTTC GATGATGATT GGTTATGCCG GAATGGGCGT ACCAATTGAC
TACCTGTTAG CGGCATCGCT GATGGCGATC CCAGGTGGTA TTTTGTTTGC ACGTATTCTT
AGCCCGGCCA CCGAGCCTTC GCAGGTCACA TTTGAAAATC TGTCGTTCAG CGAAACGCCG
CCAAAAAGCT TTATCGAAGC GGCGGCGAGC GGTGCGATGA CTGGGCTGAA AATCGCCGCT
GGTGTAGCGA CGGTGGTAAT GGCGTTTGTC GCAATTATTG CGCTGATCAA CGGCATTATC
GGCGGAATTG GTGGCTGGTT TGGTTTCGCC AATGCCTCTC TGGAAAGTAT TTTTGGCTAT
GTGCTGGCAC CGCTGGCGTG GATCATGGGT GTGGACTGGA GTGATGCCAA TCTTGCGGGT
AGCCTGATTG GGCAGAAACT GGCGATTAAC GAATTCGTCG CTTACCTGAG TTTCTCCCCA
TACCTGCAAA CGGGCGGCAC GCTGGAAGTG AAAACCATTG CGATTATCTC CTTTGCGCTT
TGTGGTTTTG CTAACTTTGG TTCTATCGGT GTTGTCGTTG GCGCATTTTC GGCTATTTCG
CCAAAACGTG CTCCAGAAAT CGCCCAGCTT GGTTTACGGG CGCTGGCAGC AGCAACGCTT
TCCAACCTGA TGAGTGCGAC TATTGCCGGG TTCTTTATTG GTCTGGCGTA A
 
Protein sequence
MDIMRSVVGM VVLLAIAFLL SVNKKSISLR TVGAALLLQI AIGGIMLYFP PGKWAVEQAA 
LGVHKVMSYS DAGSAFIFGS LVGPKMDVLF DGAGFIFAFR VLPAIIFVTA LISLLYYIGV
MGLLIRILGS IFQKALNISK IESFVAVTTI FLRQNEIPAI VKPFIDRMNR NELFTAICSG
MASIAGSMMI GYAGMGVPID YLLAASLMAI PGGILFARIL SPATEPSQVT FENLSFSETP
PKSFIEAAAS GAMTGLKIAA GVATVVMAFV AIIALINGII GGIGGWFGFA NASLESIFGY
VLAPLAWIMG VDWSDANLAG SLIGQKLAIN EFVAYLSFSP YLQTGGTLEV KTIAIISFAL
CGFANFGSIG VVVGAFSAIS PKRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA