Gene B21_02244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02244 
SymbolyfdU 
ID8114120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2367454 
End bp2369148 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content47% 
IMG OID644848449 
Producthypothetical protein 
Protein accessionYP_003000022 
Protein GI251785718 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03254] oxalyl-CoA decarboxylase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATC AACTTCAAAT GACAGATGGT ATGCATATCA TCGTTGAAGC ATTAAAACAG 
AATAATATTG ACACTATTTA TGGTGTTGTA GGTATTCCTG TGACGGATAT GGCACGCCAT
GCCCAGGCGG AAGGCATTCG TTATATTGGT TTTCGTCATG AGCAGTCGGC AGGCTATGCC
GCTGCGGCAA GCGGTTTTCT TACCCAAAAA CCGGGGATCT GCCTGACAGT TTCTGCGCCA
GGATTCCTCA ATGGTTTGAC CGCATTGGCC AACGCAACGG TAAATGGTTT TCCGATGATC
ATGATTAGCG GCTCCAGCGA CCGCGCGATC GTCGACCTAC AGCAAGGTGA TTATGAAGAG
CTGGACCAAA TGAATGCGGC AAAACCGTAT GCCAAAGCAG CATTTCGCGT TAATCAGCCG
CAGGATCTTG GCATTGCATT GGCACGCGCT ATCCGGGTCT CTGTATCGGG TCGCCCTGGC
GGAGTTTATC TTGATTTGCC AGCAAATGTC CTGGCCGCGA CGATGGAAAA AGACGAAGCG
TTAACCACGA TTGTTAAAGT TGAAAATCCG TCGCCAGCAT TATTGCCATG CCCGAAGTCA
GTCACTAGCG CAATTTCGCT TTTAGCAAAA GCTGAACGGC CATTAATTAT CCTTGGCAAA
GGCGCGGCGT ATTCACAAGC TGATGAACAG CTTCGTGAAT TTATTGAAAG TGCTCAGATT
CCATTCCTGC CAATGTCTAT GGCGAAAGGG ATCCTTGAAG ATACGCATCC ACTTTCTGCG
GCAGCTGCGC GTTCGTTTGC CCTGGCAAAT GCTGACGTTG TCATGCTTGT TGGTGCACGA
CTGAATTGGT TATTGGCACA CGGTAAAAAA GGATGGGCGG CAGATACACA GTTTATTCAA
CTGGATATTG AACCGCAGGA AATTGACAGC AACCGCCCCA TTGCTGTGCC AGTCGTTGGT
GATATTGCAT CCAGTATGCA AGGTATGCTG GCAGAACTGA AACAAAACAC ATTTACGACT
CCACTGGTAT GGCGCGATAT TTTAAATATC CACAAGCAGC AAAATGCACA AAAAATGCAT
GAAAAATTAA GTACAGATAC TCAACCATTA AATTACTTTA ATGCATTAAG TGCTGTGCGC
GACGTATTGC GCGAGAACCA GGATATTTAT TTAGTTAATG AAGGTGCAAA TACCCTGGAT
AATGCACGAA ATATTATTGA TATGTATAAA CCACGTCGTC GTCTGGATTG TGGTACCTGG
GGTGTCATGG GCATCGGTAT GGGCTATGCC ATCGGTGCTA GCGTGACTTC TGGTTCTCCG
GTTGTCGCCA TTGAAGGTGA TAGTGCTTTT GGTTTCAGTG GGATGGAAAT TGAAACGATT
TGTCGATATA ACCTGCCGGT GACGATCGTT ATTTTTAATA ATGGCGGCAT CTACAGAGGA
GACGGTGTTG ATCTCAGTGG CGCTGGTGCA CCATCACCAA CGGATCTGTT GCACCATGCA
AGGTATGACA AATTAATGGA TGCGTTTCGT GGCGTTGGCT ATAACGTCAC CACGACAGAT
GAACTTCGTC ATGCTTTAAC CACCGGTATT CAGTCGCGCA AACCGACCAT TATTAATGTG
GTCATCGACC CTGCAGCAGG AACTGAAAGT GGCCATATTA CCAAACTTAA CCCAAAACAA
GTCGCTGGTA ATTAA
 
Protein sequence
MSDQLQMTDG MHIIVEALKQ NNIDTIYGVV GIPVTDMARH AQAEGIRYIG FRHEQSAGYA 
AAASGFLTQK PGICLTVSAP GFLNGLTALA NATVNGFPMI MISGSSDRAI VDLQQGDYEE
LDQMNAAKPY AKAAFRVNQP QDLGIALARA IRVSVSGRPG GVYLDLPANV LAATMEKDEA
LTTIVKVENP SPALLPCPKS VTSAISLLAK AERPLIILGK GAAYSQADEQ LREFIESAQI
PFLPMSMAKG ILEDTHPLSA AAARSFALAN ADVVMLVGAR LNWLLAHGKK GWAADTQFIQ
LDIEPQEIDS NRPIAVPVVG DIASSMQGML AELKQNTFTT PLVWRDILNI HKQQNAQKMH
EKLSTDTQPL NYFNALSAVR DVLRENQDIY LVNEGANTLD NARNIIDMYK PRRRLDCGTW
GVMGIGMGYA IGASVTSGSP VVAIEGDSAF GFSGMEIETI CRYNLPVTIV IFNNGGIYRG
DGVDLSGAGA PSPTDLLHHA RYDKLMDAFR GVGYNVTTTD ELRHALTTGI QSRKPTIINV
VIDPAAGTES GHITKLNPKQ VAGN