Gene B21_02049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02049 
SymbolyeiJ 
ID8114690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2147055 
End bp2148305 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content51% 
IMG OID644848259 
Producthypothetical protein 
Protein accessionYP_002999832 
Protein GI251785528 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCA TGAGAAGTGT TCTGGGAATG GTGGTATTGC TGACGATTGC GTTTTTGCTG 
TCAGTAAACA AGAAGAAGAT CAGCCTGCGT ACCGTTGGCG CGGCGTTAGT GTTACAGGTC
GTGATTGGCG GGATTATGCT TTGGTTACCG CCAGGGCGTT GGGTCGCTGA AAAAGTCGCT
TTTGGCGTGC ATAAAGTGAT GGCGTACAGC GACGCGGGTA GCGCATTTAT CTTCGGTTCG
CTGGTCGGGC CGAAAATGGA CACGCTGTTT GATGGCGCAG GATTTATCTT TGGTTTCAGG
GTATTACCGG CAATTATCTT CGTCACTGCA CTGGTGAGTA TTCTCTACTA CATCGGTGTG
ATGGGGATTT TAATTCGCAT TCTCGGCGGT ATATTCCAGA AAGCATTAAA TATCAGCAAG
ATTGAGTCAT TCGTCGCGGT CACCACCATT TTCCTCGGGC AAAACGAAAT TCCGGCGATC
GTCAAACCCT TTATCGATCG TCTGAATCGC AATGAATTAT TTACAGCGAT TTGTAGTGGC
ATGGCCTCGA TTGCTGGTTC GACAATGATT GGTTACGCCG CCCTGGGCGT ACCTGTGGAA
TATTTGCTGG CGGCATCGTT AATGGCGATC CCAGGCGGGA TCTTGTTTGC CCGCCTGTTA
AGCCCGGCTA CGGAATCTTC GCAGGTTTCC TTTAATAACC TCTCTTTCAC CGAAACACCG
CCAAAAAGCA TTATTGAAGC CGCTGCGACA GGGGCAATGA CCGGGCTGAA AATCGCCGCC
GGTGAAGCGA CAGTTGTTAT GGCATTTGTC GCCATCATTG CGTTAATTAA TGGTATTATC
GGCGGCGTTG GCGGCTGGTT TGGTTTTGCA CATGCCTCGC TGGAGTCCAT TTTAGGTTAC
CTGTTGGCCC CATTGGCGTG GGTGATGGGG GGTGACTGGA GTGATGCAAA TCTTGCCGGG
AGTTTGATTG GGCAGAAGCT GGCGATCAAT GAATTTGTCG CTTATCTCAA TTTCTCGCCA
TATCTGCAAA CGGGTGGCAC TCTGGATGCT AAAACCGTGG CGATTATTTC TTTCGCGTTG
TGCGGTTTCG CTAACTTTGG TTCTATCGGG GTGGTGGTGG GGGCGTTTTC TGCGGTTGCG
CCACACCGTG CGCCGGAAAT CGCCCAACTT GGTTTACGCG CGCTGGCGGC GGCGACACTT
TCTAACCTGA TGAGTGCTAC TATTGCAGGA TTCTTTATTG GTTTAGCGTA G
 
Protein sequence
MDVMRSVLGM VVLLTIAFLL SVNKKKISLR TVGAALVLQV VIGGIMLWLP PGRWVAEKVA 
FGVHKVMAYS DAGSAFIFGS LVGPKMDTLF DGAGFIFGFR VLPAIIFVTA LVSILYYIGV
MGILIRILGG IFQKALNISK IESFVAVTTI FLGQNEIPAI VKPFIDRLNR NELFTAICSG
MASIAGSTMI GYAALGVPVE YLLAASLMAI PGGILFARLL SPATESSQVS FNNLSFTETP
PKSIIEAAAT GAMTGLKIAA GEATVVMAFV AIIALINGII GGVGGWFGFA HASLESILGY
LLAPLAWVMG GDWSDANLAG SLIGQKLAIN EFVAYLNFSP YLQTGGTLDA KTVAIISFAL
CGFANFGSIG VVVGAFSAVA PHRAPEIAQL GLRALAAATL SNLMSATIAG FFIGLA