Gene B21_04153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04153 
SymboluxuA 
ID8115674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4459005 
End bp4460189 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content54% 
IMG OID644850297 
Producthypothetical protein 
Protein accessionYP_003001870 
Protein GI251787566 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1312] D-mannonate dehydratase 
TIGRFAM ID[TIGR00695] mannonate dehydratase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGA CCTGGCGCTG GTACGGCCCA AACGATCCGG TTTCTTTAGC TGATGTCCGT 
CAGGCGGGCG CAACTGGCGT GGTTACCGCG CTGCACCATA TCCCGAACGG CGAAGTATGG
TCCGTTGAAG AGATCCTCAA ACGCAAGGCG ATCATTGAAG ACGCAGGCCT GGTGTGGTCT
GTCGTAGAAA GCGTGCCAAT TCACGAAGAT ATCAAAACCC ACACTGGCAA CTATGAGCAG
TGGATCGCTA ACTATCAGCA GACTCTGCGC AACCTGGCGC AGTGTGGCAT TCGCACCGTG
TGCTACAACT TCATGCCGGT GCTCGACTGG ACCCGTACTG ACCTCGAATA CGTGCTGCCA
GACGGCTCCA AAGCTCTGCG CTTCGACCAG ATCGAATTCG CTGCATTCGA AATGCATATC
CTGAAACGCC CAGGCGCGGA AGCGGATTAC ACCGAAGAAG AAATTGCTCA GGCAGCTGAA
CGCTTCGCCA CTATGAGCGA TGAAGACAAA GCGCGTCTGA CCCGTAACAT CATTGCTGGT
CTTCCGGGCG CGGAAGAAGG CTACACCCTC GACCAGTTCC GTAAACACCT GGAGCTGTAC
AAAGATATCG ACAAAGCGAA GCTGCGCGAA AACTTTGCCG TCTTCCTGAA AGCGATTATT
CCAGTTGCTG AAGAAGTCGG CGTGCGTATG GCTGTTCACC CGGACGATCC GCCGCGCCCG
ATCCTCGGCC TGCCGCGCAT TGTTTCCACC ATTGAAGATA TGCAGTGGAT GGTTGATACC
GTAAACAGCA TGGCAAACGG TTTTACCATG TGCACCGGTT CCTACGGCGT GCGTGCTGAC
AACGATCTGG TTGATATGAT CAAGCAGTTC GGTCCGCGTA TTTACTTCAC CCATCTGCGC
TCCACCATGC GTGAAGATAA CCCGAAAACC TTCCACGAAG CGGCGCACCT GAACGGTGAC
GTTGATATGT ACGAAGTGGT GAAAGCGATT GTTGAAGAAG AACACCGTCG TAAAGCGGAA
GGCAAAGAAG ACCTGATCCC GATGCGTCCG GACCACGGTC ATCAGATGCT GGACGACCTG
AAGAAGAAAA CCAACCCAGG TTACTCCGCA ATTGGTCGTC TGAAAGGCCT GGCCGAAGTT
CGCGGTGTCG AACTGGCGAT CCAGCGCGCT TTCTTTAGCC GTTAA
 
Protein sequence
MEQTWRWYGP NDPVSLADVR QAGATGVVTA LHHIPNGEVW SVEEILKRKA IIEDAGLVWS 
VVESVPIHED IKTHTGNYEQ WIANYQQTLR NLAQCGIRTV CYNFMPVLDW TRTDLEYVLP
DGSKALRFDQ IEFAAFEMHI LKRPGAEADY TEEEIAQAAE RFATMSDEDK ARLTRNIIAG
LPGAEEGYTL DQFRKHLELY KDIDKAKLRE NFAVFLKAII PVAEEVGVRM AVHPDDPPRP
ILGLPRIVST IEDMQWMVDT VNSMANGFTM CTGSYGVRAD NDLVDMIKQF GPRIYFTHLR
STMREDNPKT FHEAAHLNGD VDMYEVVKAI VEEEHRRKAE GKEDLIPMRP DHGHQMLDDL
KKKTNPGYSA IGRLKGLAEV RGVELAIQRA FFSR