Gene B21_03739 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03739 
SymbolrhaB 
ID8113844 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4001980 
End bp4003449 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content55% 
IMG OID644849899 
Producthypothetical protein 
Protein accessionYP_003001472 
Protein GI251787168 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1070] Sugar (pentulose and hexulose) kinases 
TIGRFAM ID[TIGR02627] rhamnulokinase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTTC GCAATTGTGT CGCCGTCGAT CTCGGCGCAT CCAGTGGGCG CGTGATGCTG 
GCGCGTTACG AGCGTGAATG CCGCAGCCTG ACGCTGCGCG AAATCCATCG TTTTAACAAT
GGGCTGCATA GTCAGAACGG CTATGTCACC TGGGATGTGG ATAGCCTGGA AAGTGCCATT
CGCCTTGGAT TAAACAAGGT GTGCGAGGAA GGGATTCGTA TCGATAGCAT TGGGATTGAT
ACCTGGGGCG TGGACTTTGT GCTGCTCGAC CAACAGGGTC AGCGTGTGGG CCTGCCCGTT
GCTTATCGCG ATAGCCGCAC CAATGGCCTA ATGGCGCAGG CACAACAACA ACTCGGCAAA
CGCGATATTT ATCAACGTAG CGGCATCCAG TTTCTGCCCT TCAATACGCT TTATCAGTTG
CGTGCGCTGA CGGAGCAACA ACCTGAACTT ATTCCACACA TTGCTCACGC TCTGCTGATG
CCGGATTACT TCAGTTATCG CCTGACCGGC AAGATGAACT GGGAATATAC CAACGCCACG
ACCACGCAAC TGGTCAATAT CAATAGCGAC GACTGGGACG AGTCGCTACT GGCGTGGAGC
GGGGCCAACA AAGCCTGGTT TGGTCGCCCG ACGCATCCGG GTAATGTCAT AGGTCACTGG
ATTTGCCCGC AGGGTAATGA GATTCCGGTG GTCGCCGTTG CCAGCCATGA TACCGCCAGC
GCGGTTATCG CCTCGCCGTT AAACGGTTCA CGCGCCGCTT ATCTCTCTTC TGGCACCTGG
TCATTGATGG GCTTCGAAAG CCAGACGCCA TTTACCAATG ACACGGCGCT GGCAGCCAAC
ATCACCAATG AAGGCGGGGC GGAAGGTCGC TATCGGGTGC TGAAAAATAT TATGGGCTTA
TGGCTGCTTC AGCGAGTGCT ACAGGAGCGG CAAATCAACG ATCTCCCGGC GCTTATCGCC
GCGACACAGG CACTTCCGGC CTGCCGCTTC ATCATCAATC CCAATGACGA TCGCTTTATT
AACCCTGACG AGATGTGCAG CGAAATTCAG GCTGCGTGTC GGGAAATGGC GCAACCGATC
CCAGAAAGTG ATGCTGAACT GGCGCGCTGT ATTTTCGACA GTCTGGCGTT GCTGTATGCC
GATGTGTTGC ATGAGCTGGC GCAGCTACGC GGTGAAGATT TCTCGCAACT GCATATTGTC
GGCGGCGGCT GCCAGAACAC GCTGCTCAAC CAGCTATGTG CCGATGCCTG CGGTATTCGG
GTGATCGCCG GGCCTGTTGA AGCCTCGACG CTCGGCAATA TCGGCATCCA GTTAATGACG
CTGGATGAAC TCAACAATGT GGATGATTTC CGTCAGGTCG TCAGCACCAC CGCGAATCTG
ACCACCTTTA CCCCTAATCC TGACAGTGAA ATTGCCCACT ATGTGGCGCT GATTCACTCT
ACACGACAGA CAAAGGAGCT TTGCGCATGA
 
Protein sequence
MTFRNCVAVD LGASSGRVML ARYERECRSL TLREIHRFNN GLHSQNGYVT WDVDSLESAI 
RLGLNKVCEE GIRIDSIGID TWGVDFVLLD QQGQRVGLPV AYRDSRTNGL MAQAQQQLGK
RDIYQRSGIQ FLPFNTLYQL RALTEQQPEL IPHIAHALLM PDYFSYRLTG KMNWEYTNAT
TTQLVNINSD DWDESLLAWS GANKAWFGRP THPGNVIGHW ICPQGNEIPV VAVASHDTAS
AVIASPLNGS RAAYLSSGTW SLMGFESQTP FTNDTALAAN ITNEGGAEGR YRVLKNIMGL
WLLQRVLQER QINDLPALIA ATQALPACRF IINPNDDRFI NPDEMCSEIQ AACREMAQPI
PESDAELARC IFDSLALLYA DVLHELAQLR GEDFSQLHIV GGGCQNTLLN QLCADACGIR
VIAGPVEAST LGNIGIQLMT LDELNNVDDF RQVVSTTANL TTFTPNPDSE IAHYVALIHS
TRQTKELCA