Gene B21_02804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02804 
Symbolybl135 
ID8116232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2994124 
End bp2995203 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content52% 
IMG OID644848993 
Producthypothetical protein 
Protein accessionYP_003000566 
Protein GI251786262 
COG category[R] General function prediction only 
COG ID[COG0795] Predicted permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000115691 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTT TCAGCCGCTA TTTAATCCGT CATCTCTTTC TCGGTTTTGC CGCCGCCGCA 
GGGCTATTGC TGCCGCTTTT TACCACCTTC AACCTGATTA ACGAACTGGA TGATGTCAGC
CCGGGCGGTT ATCGCTGGAC TCAGGCGGTG CTGGTGGTGC TAATGACCTT ACCGCGCACA
CTGGTCGAAC TTTCGCCATT TATCGCCTTA TTGGGAGGGA TTGTCGGCCT GGGGCAGTTA
TCGAAAAACA GTGAGCTTAC CGCCATTCGC AGCATGGGGT TTTCTATCTT CCGTATTGCA
CTGGTGGCGC TGGTTGCAGG GATATTGTGG ACTGTTTCGT TAGGCGCGAT AGATGAGTGG
GTAGCGTCGC CATTACAGCA ACAGGCGTTG CAAATCAAAT CGACCGCCAC CGCGTTGGGG
GAGGACGATG ACATTACCGG CAATATGCTG TGGGCCAGGC GTGGTAATGA ATTTGTGACG
GTGAAATCGC TGAACGAGCA GGGCCAGCCG GTGGGCGTGG AGATTTTTCA TTATCGCGAC
GATCTCTCGC TCGAATCCTA CATTTATGCA CGCAGTGCCA CCATTGAAGA CGACAAAACG
TGGATCCTGC ATGGTGTGAA TCATAAAAAA TGGCTGAATG GCAAAGAAAC GCTGGAAACA
TTAGATAATC TTGCCTGGCA ATCGGCCTTC ACCAGTATGG ATCTTGAAGA GTTATCAATG
CCGGGCAATA CTTTTTCTGT CCGTCAGCTT AATCATTACA TCCATTATTT GCAGGAAACC
GGGCAACCCA GCAGCGAATA CCGCCTTGCA CTGTGGGAAA AACTGGGGCA ACCCATCCTG
ACCCTGGCGA TGATTTTGCT GGCTGTGCCG TTTACCTTTA GCGCCCCGCG CTCGCCAGGG
ATGGGTAGCC GTCTCGCTGT AGGTGTCATC GTTGGCTTAC TCACCTGGAT CAGCTATCAA
ATCATGGTCA ATCTGGGATT GCTATTTGCG TTGAGCGCAC CTGTTACCGC GCTCGGTTTA
CCGATAGCGT TTGTGCTGGT GGCGTTGAGC CTGGTGTATT GGTATGACAG ACAACATTAA
 
Protein sequence
MNVFSRYLIR HLFLGFAAAA GLLLPLFTTF NLINELDDVS PGGYRWTQAV LVVLMTLPRT 
LVELSPFIAL LGGIVGLGQL SKNSELTAIR SMGFSIFRIA LVALVAGILW TVSLGAIDEW
VASPLQQQAL QIKSTATALG EDDDITGNML WARRGNEFVT VKSLNEQGQP VGVEIFHYRD
DLSLESYIYA RSATIEDDKT WILHGVNHKK WLNGKETLET LDNLAWQSAF TSMDLEELSM
PGNTFSVRQL NHYIHYLQET GQPSSEYRLA LWEKLGQPIL TLAMILLAVP FTFSAPRSPG
MGSRLAVGVI VGLLTWISYQ IMVNLGLLFA LSAPVTALGL PIAFVLVALS LVYWYDRQH