Gene B21_03856 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03856 
SymbollysC 
ID8115735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4137344 
End bp4138693 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content54% 
IMG OID644850012 
Producthypothetical protein 
Protein accessionYP_003001585 
Protein GI251787281 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAAA TTGTTGTCTC CAAATTTGGC GGTACCAGCG TAGCTGATTT TGACGCCATG 
AACCGCAGCG CTGATATTGT GCTTTCTGAT GCCAACGTGC GTTTAGTTGT CCTCTCGGCT
TCTGCTGGTA TCACTAATCT GCTGGTCGCT TTAGCTGAAG GACTGGAACC TGGCGAGCGA
TTCGAAAAAC TCGACGCTAT CCGCAACATC CAGTTTGCCA TTCTGGAACG TCTGCGTTAC
CCGAACGTTA TCCGTGAAGA GATTGAACGT CTGCTGGAGA ACATTACTGT TCTGGCAGAA
GCGGCGGCGC TGGCAACGTC TCCGGCGCTG ACAGATGAGC TGGTCAGCCA CGGCGAGCTG
ATGTCGACCC TGCTGTTTGT TGAGATCCTG CGCGAACGCG ATGTTCAGGC ACAGTGGTTT
GATGTACGTA AAGTGATGCG TACCAACGAC CGATTTGGTC GTGCAGAGCC AGATATAGCC
GCGCTGGCGG AACTGGCCGC GCTGCAGCTG CTCCCACGTC TCAATGAAGG CTTAGTGATC
ACCCAGGGAT TTATCGGTAG CGAAAATAAA GGTCGTACAA CGACGCTTGG CCGTGGAGGC
AGCGATTATA CGGCAGCCTT GCTGGCGGAG GCTTTACACG CATCTCGTGT TGATATCTGG
ACCGACGTCC CGGGCATCTA CACCACCGAT CCACGCGTAG TTTCCGCAGC AAAACGCATT
GATGAAATCG CGTTTGCCGA AGCGGCAGAG ATGGCAACTT TTGGTGCAAA AGTACTGCAT
CCGGCAACGT TGCTACCCGC AGTACGCAGC GATATCCCGG TCTTTGTCGG CTCCAGCAAA
GACCCACGCG CAGGTGGTAC GCTGGTGTGC AATAAAACTG AAAATCCGCC GCTGTTCCGC
GCTCTGGCGC TTCGTCGCAA TCAGACTCTG CTCACTTTGC ACAGCCTGAA TATGCTGCAT
TCTCGCGGTT TCCTCGCGGA AGTTTTCGGC ATCCTCGCGC GGCATAATAT TTCGGTAGAC
TTAATCACCA CGTCAGAAGT GAGCGTGGCA TTAACCCTTG ATACCACCGG TTCAACCTCC
ACTGGCGATA CGTTGCTGAC GCAATCTCTG CTGATGGAGC TTTCCGCACT GTGTCGGGTG
GAGGTGGAAG AAGGTCTGGC GCTGGTCGCG TTGATTGGCA ATGACCTGTC AAAAGCCTGC
GGCGTTGGCA AAGAGGTATT CGGCGTACTG GAACCGTTCA ACATTCGCAT GATTTGTTAT
GGCGCATCCA GCCATAACCT GTGCTTCCTG GTGCCCGGCG AAGATGCCGA GCAGGTGGTG
CAAAAACTGC ATAGTAATTT GTTTGAGTAA
 
Protein sequence
MSEIVVSKFG GTSVADFDAM NRSADIVLSD ANVRLVVLSA SAGITNLLVA LAEGLEPGER 
FEKLDAIRNI QFAILERLRY PNVIREEIER LLENITVLAE AAALATSPAL TDELVSHGEL
MSTLLFVEIL RERDVQAQWF DVRKVMRTND RFGRAEPDIA ALAELAALQL LPRLNEGLVI
TQGFIGSENK GRTTTLGRGG SDYTAALLAE ALHASRVDIW TDVPGIYTTD PRVVSAAKRI
DEIAFAEAAE MATFGAKVLH PATLLPAVRS DIPVFVGSSK DPRAGGTLVC NKTENPPLFR
ALALRRNQTL LTLHSLNMLH SRGFLAEVFG ILARHNISVD LITTSEVSVA LTLDTTGSTS
TGDTLLTQSL LMELSALCRV EVEEGLALVA LIGNDLSKAC GVGKEVFGVL EPFNIRMICY
GASSHNLCFL VPGEDAEQVV QKLHSNLFE