Gene B21_01101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01101 
SymbolyceG 
ID8114292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1156088 
End bp1157110 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content51% 
IMG OID644847358 
Producthypothetical protein 
Protein accessionYP_002998931 
Protein GI251784627 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000104993 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG TGTTATTGAT AATCTTGTTA TTGCTGGTGG TACTGGGTAT CGCCGCTGGT 
GTGGGCGTCT GGAAGGTTCG CCATCTTGCC GACAGCAAAT TGCTTATCAA AGAAGAGACG
ATATTTACCC TGAAGCCAGG GACCGGACGT CTGGCGCTCG GTGAACAGCT TTATGCTGAT
AAGATCATCA ATCGCCCACG GGTTTTTCAA TGGCTGCTGC GTATCGAACC GGATCTTTCT
CACTTTAAAG CCGGGACTTA CCGCTTTACA CCGCAGATGA CCGTGCGCGA GATGCTGAAA
TTGTTGGAAA GCGGTAAAGA AGCACAGTTC CCGCTGCGAC TGGTAGAAGG GATGCGTCTG
AGCGACTACC TCAAGCAATT GCGTGAGGCT CCGTATATCA AGCACACGCT GAGTGACGAT
AAGTACGCCA CCGTAGCGCA GGCACTTGAA CTGGAAAACC CGGAGTGGAT TGAAGGTTGG
TTCTGGCCAG ACACCTGGAT GTATACCGCC AATACCACCG ATGTCGCGTT ACTCAAGCGA
GCGCACAAGA AAATGGTGAA AGCGGTCGAT AGCGCCTGGG AAGGGCGTGC GGACGGTCTG
CCTTATAAAG ATAAAAATCA GCTGGTGACG ATGGCATCAA TTATCGAAAA AGAAACCGCC
GTTGCCAGTG AACGCGATCA GGTTGCCTCG GTATTTATCA ACCGTTTACG CATTGGTATG
CGCTTGCAGA CCGACCCAAC CGTGATTTAC GGGATGGGAG AGCGTTATAA TGGCAAACTT
TCTCGTGCAG ACCTGGAAAC GCCGACAGCG TATAACACCT ATACCATTAC CGGTTTGCCG
CCGGGTGCGA TAGCTACGCC GGGGGCGGAT TCGCTGAAGG CTGCTGCGCA TCCGGCAAAA
ACGCCGTATC TCTATTTTGT GGCCGATGGT AAAGGTGGTC ACACGTTTAA TACCAATCTT
GCCAGTCATA ACAAGTCTGT GCAGGATTAT CTGAAAGTGC TTAAGGAAAA AAATGCGCAG
TAA
 
Protein sequence
MKKVLLIILL LLVVLGIAAG VGVWKVRHLA DSKLLIKEET IFTLKPGTGR LALGEQLYAD 
KIINRPRVFQ WLLRIEPDLS HFKAGTYRFT PQMTVREMLK LLESGKEAQF PLRLVEGMRL
SDYLKQLREA PYIKHTLSDD KYATVAQALE LENPEWIEGW FWPDTWMYTA NTTDVALLKR
AHKKMVKAVD SAWEGRADGL PYKDKNQLVT MASIIEKETA VASERDQVAS VFINRLRIGM
RLQTDPTVIY GMGERYNGKL SRADLETPTA YNTYTITGLP PGAIATPGAD SLKAAAHPAK
TPYLYFVADG KGGHTFNTNL ASHNKSVQDY LKVLKEKNAQ