Gene B21_04033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04033 
Symbolybl210 
ID8113916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4331272 
End bp4332549 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content47% 
IMG OID644850183 
Producthypothetical protein 
Protein accessionYP_003001756 
Protein GI251787452 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA TAAAGAATTA CCGGTGGCAT ATGATTGCCC TCGTATGCTT TATCACTGTA 
ATCAATTATC TGGACAGAAC GGCATTAGGT ATTGCGGCTC CAACGATTAT GGAGACAACC
GGAATAACTA AAGAGCAATA TTCATGGATT GTCAGTGCAT TCCAGTTGGC CTATACATTA
GGGCAACCGG TAATGGGCTT CTTTATTGAT ACCGTGGGTC TGAAGTTAAG TTTTGCGATA
TGTGCCGCAA TTTGGGGCCT GGCGACAATG GGCCATGCAC TCACCGGAAC GTGGTCTGGT
CTGGCATTTA TGCGCGCCCT GATGGGTTTC AGCGAAGCGT CGGCAATTCC GGCGGGTGTA
AAAACCGCAT CAACATGGTT CCCGGCAAAA GAGCGTGGCG TGGCGACAGG TGTTTTCAAT
ATGGGCACCT CACTCGGCGC GATGCTTGCT CCACCGTTGA TTGCCTGGTG CATTATGTTT
CATAGCTGGC AATTTGCGTT TATTGTTTCA GGTAGCCTTG CTTTGCTCGC GGCTTTATTT
TGGTTCTTTT GTTATAAAGA TCCGAAAGAT GCCAAACGCC TTTCTGATGA AGAGCGCCAC
TATATTGAAT CAGGACAAGA ACAGCATCTT AAAACAGATA AGAAAGAAAA AACGTCAATC
AAGCATATCC TCAGCCAACG TAATTTCTGG GGGATTGGCA TCGCGCGTTT TCTCGCAGAC
CCGGCATGGG GAACCATTAA CTTCTGGGTG CCGATTTTCT TCGTCGAAAC GCTGCATTTT
AGCCTGAAAG AAATTGCCAT GTTCGTCTGG CTGCCTTTCC TGCTGGGCGA TCTCGGCTGT
TTAGCCAGTG GTTTTGTCGC GAAGTTCTTC CACGATCGCG GCGTGAGTTT AATTAACTCA
CGAAGAATTA CCTTCACTAT TGCAGCCGTC ATTATGATGA CGATTGGCCT GGTGAGTATT
GTCGAAAATC CCTACATTGC CGTATTACTG ATTAGTATTG GCGCGTTCTC GCATCAATGT
CTTTCTACTG TAGCAGCAAC TCTGGGTGGC GATCTGTTCA AAAAAGACGA AGTAGCTACC
GCAGTGGGTA TGGCAGGAGC CTGTGCGTGG AGCGGTCAGT TGATTTTCAA CCTGTTTATC
GGGGCATTCG TTCACATTAT CGGCTTCGCG CCGTTCTTTA TTGCCCTGGC TTTCTTTGAC
ATTATTGGCG CCATTGCGCT GTGGACGCTT ATCAAAGTTA AAGATGAAGA ACCGCAAGTA
CAGTTAGCGA CAAGCTAA
 
Protein sequence
MNKIKNYRWH MIALVCFITV INYLDRTALG IAAPTIMETT GITKEQYSWI VSAFQLAYTL 
GQPVMGFFID TVGLKLSFAI CAAIWGLATM GHALTGTWSG LAFMRALMGF SEASAIPAGV
KTASTWFPAK ERGVATGVFN MGTSLGAMLA PPLIAWCIMF HSWQFAFIVS GSLALLAALF
WFFCYKDPKD AKRLSDEERH YIESGQEQHL KTDKKEKTSI KHILSQRNFW GIGIARFLAD
PAWGTINFWV PIFFVETLHF SLKEIAMFVW LPFLLGDLGC LASGFVAKFF HDRGVSLINS
RRITFTIAAV IMMTIGLVSI VENPYIAVLL ISIGAFSHQC LSTVAATLGG DLFKKDEVAT
AVGMAGACAW SGQLIFNLFI GAFVHIIGFA PFFIALAFFD IIGAIALWTL IKVKDEEPQV
QLATS