Gene ECD_02090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_02090 
SymbolyeiJ 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp2147743 
End bp2148993 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID 
Productpredicted nucleoside transporter 
Protein accessionACT43913 
Protein GI253978243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTGCTG 
TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC
GTGATTGGCG GGATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT
TTTGGCGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG
CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGCGCAG GATTTATCTT TGGTTTCAGG
GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG
ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG
ATTGAGTCAT TCGTCGCGGT CACCACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATC
GTCAAACCCT TTATCGATCG TCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC
ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CCCTGGGCGT ACCTGTGGAA
TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA
AGCCCGGCTA CGGAATCTTC GCAGGTTTCC TTTAATAACC TCTCTTTCAC CGAAACACCG
CCAAAAAGCA TTATTGAAGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCC
GGTGAAGCGA CAGTTGTTAT GGCATTTGTC GCCATCATTG CGTTAATTAA TGGTATTATC
GGCGGCGTTG GCGGCTGGTT TGGTTTTGCA CATGCCTCGC TGGAGTCCAT TTTAGGTTAC
CTGTTGGCCC CATTGGCGTG GGTGATGGGG GGTGACTGGA GTGATGCAAA TCTTGCCGGG
AGTTTGATTG GGCAGAAGCT GGCGATCAAT GAATTTGTCG CTTATCTCAA TTTCTCGCCA
TATCTGCAAA CGGGTGGCAC TCTGGATGCT AAAACCGTGG CGATTATTTC TTTCGCGTTG
TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTGGTGGTGG GGGCGTTTTC TGCGGTTGCG
CCACACCGTG CGCCGGAAAT CGCCCAACTT GGTTTACGCG CGCTGGCGGC GGCGACACTT
TCTAACCTGA TGAGTGCTAC TATTGCAGGA TTCTTTATTG GTTTAGCGTA G
 
Protein sequence
MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA 
FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV
MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG
MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP
PKSIIEAAAT GAMTGLKIAA GEATVVMAFV AIIALINGII GGVGGWFGFA HASLESILGY
LLAPLAWVMG GDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTGGTLDA KTVAIISFAL
CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA