Gene B21_03971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03971 
SymbolaspA 
ID8113426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4272067 
End bp4273503 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content49% 
IMG OID644850123 
Producthypothetical protein 
Protein accessionYP_003001696 
Protein GI251787392 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1027] Aspartate ammonia-lyase 
TIGRFAM ID[TIGR00839] aspartate ammonia-lyase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAACA ACATTCGTAT CGAAGAAGAT CTGTTGGGTA CCAGGGAAGT TCCAGCTGAT 
GCCTACTATG GTGTTCACAC TCTGAGAGCG ATTGAAAACT TCTATATCAG CAACAACAAA
ATCAGTGATA TTCCTGAATT TGTTCGCGGT ATGGTAATGG TTAAAAAAGC CGCAGCTATG
GCAAACAAAG AGCTGCAAAC CATTCCTAAA AGTGTAGCGA ATGCCATCAT TGCCGCATGT
GATGAAGTCC TGAACAACGG AAAATGCATG GATCAGTTCC CGGTAGACGT CTACCAGGGC
GGCGCAGGTA CTTCCGTAAA CATGAACACC AACGAAGTGC TGGCCAATAT CGGTCTGGAA
CTGATGGGTC ACCAGAAAGG TGAATATCAG TACCTGAACC CGAACGACCA TGTTAACAAA
TGTCAGTCCA CTAACGACGC CTACCCGACC GGTTTCCGTA TCGCAGTTTA CTCTTCTCTG
ATTAAGCTGG TAGATGCGAT TAACCAACTG CGTGAAGGCT TTGAACGTAA AGCTGTCGAA
TTCCAGGACA TCCTGAAAAT GGGTCGTACC CAGCTGCAGG ACGCAGTACC GATGACCCTC
GGTCAGGAAT TCCGCGCTTT CAGCATCCTG CTGAAAGAAG AAGTGAAAAA CATCCAACGT
ACCGCTGAAC TGCTGCTGGA AGTTAACCTT GGCGCAACAG CAATCGGTAC TGGTCTGAAC
ACGCCGAAAG AGTACTCTCC GCTGGCAGTG AAAAAACTGG CTGAAGTCAC TGGCTTCCCA
TGCGTACCGG CTGAAGACCT GATCGAAGCG ACCTCTGACT GCGGCGCTTA TGTTATGGTT
CACGGCGCGC TGAAACGCCT GGCTGTGAAG ATGTCCAAAA TCTGTAACGA CCTGCGCTTG
CTCTCTTCTG GCCCACGTGC CGGCCTGAAC GAGATCAACC TGCCGGAACT GCAGGCGGGC
TCTTCCATCA TGCCAGCTAA AGTAAACCCG GTTGTTCCGG AAGTGGTTAA CCAGGTATGC
TTCAAAGTCA TCGGTAACGA CACCACTGTT ACCATGGCAG CAGAAGCAGG TCAGCTGCAG
TTGAACGTTA TGGAGCCGGT CATTGGCCAG GCTATGTTCG AATCCGTTCA CATTCTGACC
AACGCTTGCT ACAACCTGCT GGAAAAATGC ATTAACGGCA TCACTGCTAA CAAAGAAGTG
TGCGAAGGTT ACGTTTACAA CTCTATCGGT ATCGTTACTT ACCTGAACCC GTTCATCGGT
CACCACAACG GTGACATCGT GGGTAAAATC TGTGCCGAAA CCGGTAAGAG TGTACGTGAA
GTCGTTCTGG AACGCGGTCT GTTGACTGAA GCGGAACTTG ACGATATTTT CTCCGTACAG
AATCTGATGC ACCCGGCTTA CAAAGCAAAA CGCTATACTG ATGAAAGCGA ACAGTAA
 
Protein sequence
MSNNIRIEED LLGTREVPAD AYYGVHTLRA IENFYISNNK ISDIPEFVRG MVMVKKAAAM 
ANKELQTIPK SVANAIIAAC DEVLNNGKCM DQFPVDVYQG GAGTSVNMNT NEVLANIGLE
LMGHQKGEYQ YLNPNDHVNK CQSTNDAYPT GFRIAVYSSL IKLVDAINQL REGFERKAVE
FQDILKMGRT QLQDAVPMTL GQEFRAFSIL LKEEVKNIQR TAELLLEVNL GATAIGTGLN
TPKEYSPLAV KKLAEVTGFP CVPAEDLIEA TSDCGAYVMV HGALKRLAVK MSKICNDLRL
LSSGPRAGLN EINLPELQAG SSIMPAKVNP VVPEVVNQVC FKVIGNDTTV TMAAEAGQLQ
LNVMEPVIGQ AMFESVHILT NACYNLLEKC INGITANKEV CEGYVYNSIG IVTYLNPFIG
HHNGDIVGKI CAETGKSVRE VVLERGLLTE AELDDIFSVQ NLMHPAYKAK RYTDESEQ