Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00002 |
Symbol | thrA |
ID | 8112805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 336 |
End bp | 2798 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 644846297 |
Product | hypothetical protein |
Protein accession | YP_002997870 |
Protein GI | 251783566 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0527] Aspartokinases |
TIGRFAM ID | [TIGR00657] aspartate kinase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.964701 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAGTGT TGAAGTTCGG CGGTACATCA GTGGCAAATG CAGAACGTTT TCTGCGGGTT GCCGATATTC TGGAAAGCAA TGCCAGGCAG GGGCAGGTGG CCACCGTCCT CTCTGCCCCC GCCAAAATCA CCAACCACCT GGTGGCGATG ATTGAAAAAA CCATTAGCGG CCAGGATGCT TTACCCAATA TCAGCGATGC CGAACGTATT TTTGCCGAAC TTTTGACGGG ACTCGCCGCC GCCCAGCCGG GATTCCCGCT GGCGCAATTG AAAACTTTCG TCGATCAGGA ATTTGCCCAA ATAAAACATG TCCTGCATGG CATTAGTTTG TTGGGGCAGT GCCCGGATAG CATCAACGCT GCGCTGATTT GCCGTGGCGA GAAAATGTCG ATCGCCATTA TGGCCGGCGT ATTAGAAGCG CGCGGTCACA ACGTTACCGT TATCGATCCG GTCGAAAAAC TGCTGGCAGT GGGGCATTAC CTCGAATCTA CCGTCGATAT TGCTGAGTCC ACCCGCCGTA TTGCGGCAAG TCGCATTCCG GCTGATCACA TGGTGCTGAT GGCAGGTTTC ACCGCCGGTA ATGAAAAAGG CGAACTGGTG GTACTTGGAC GCAACGGTTC CGACTACTCC GCGGCGGTGC TGGCTGCCTG TTTACGCGCC GATTGTTGCG AGATTTGGAC GGACGTTGAC GGGGTCTATA CCTGCGACCC GCGTCAGGTG CCCGATGCGA GGTTGTTGAA GTCGATGTCC TACCAGGAAG CGATGGAGCT TTCCTACTTC GGCGCTAAAG TTCTTCACCC CCGCACCATT ACCCCCATCG CCCAGTTCCA GATCCCTTGC CTGATTAAAA ATACCGGAAA TCCTCAAGCT CCAGGTACGC TCATTGGTGC CAGCCGTGAT GAAGACGAAT TACCGGTCAA GGGCATTTCC AATCTGAATA ATATGGCAAT GTTCAGCGTT TCCGGCCCGG GGATGAAAGG GATGGTTGGC ATGGCGGCGC GCGTGTTTGC AGCGATGTCA CGCGCCCGTA TTTCCGTGGT GCTGATTACG CAATCATCTT CCGAATACAG TATCAGTTTC TGCGTTCCGC AAAGCGACTG TGTGCGAGCT GAACGGGCAA TGCAGGAAGA GTTCTACCTG GAACTGAAAG AAGGCTTACT GGAGCCGCTG GCGGTGACGG AACGGCTGGC CATTATCTCG GTGGTAGGTG ATGGTATGCG CACCTTGCGT GGGATCTCGG CGAAATTCTT TGCCGCGCTG GCCCGCGCCA ATATCAACAT TGTCGCCATT GCTCAGGGAT CTTCTGAACG CTCAATCTCT GTCGTGGTAA ATAACGATGA TGCGACCACT GGCGTGCGCG TTACTCATCA GATGCTGTTC AATACCGATC AGGTTATCGA AGTGTTTGTG ATTGGCGTCG GTGGCGTTGG CGGTGCGCTG CTGGAGCAAC TGAAGCGTCA ACAAAGCTGG CTGAAGAATA AACATATCGA CTTACGTGTC TGCGGTGTTG CCAACTCGAA GGCACTGCTC ACCAATGTGC ATGGCCTAAA TCTGGAAAAC TGGCAGGAAG AACTGGCGCA AGCCAAAGAG CCGTTTAATC TCGGGCGCTT AATTCGCCTC GTGAAAGAAT ATCATCTGCT GAACCCGGTC ATTGTTGACT GCACTTCCAG CCAGGCAGTG GCGGATCAAT ATGCCGACTT CTTGCGCGAA GGTTTCCACG TTGTCACGCC GAACAAAAAG GCCAACACCT CGTCGATGGA TTACTACCAT CTGTTGCGTC ATGCGGCGGA AAAATCGCGG CGTAAATTCC TCTATGACAC CAACGTTGGG GCTGGATTAC CGGTTATTGA GAACCTGCAA AATCTGCTCA ATGCTGGTGA TGAATTGATG AAGTTCTCCG GCATTCTTTC AGGTTCGCTT TCTTATATCT TCGGCAAGTT AGACGAAGGC ATGAGTTTCT CCGAGGCGAC TACTCTGGCG CGGGAAATGG GTTATACCGA ACCGGATCCG CGAGATGATC TTTCTGGTAT GGATGTAGCG CGTAAGCTAT TGATTCTCGC TCGTGAAACG GGACGTGAAC TGGAGCTGGC GGATATTGAA ATTGAACCTG TGCTGCCCGC AGAGTTTAAC GCTGAGGGTG ATGTTGCCGC TTTTATGGCG AATCTGTCAC AGCTCGACGA TCTCTTTGCC GCGCGCGTGG CGAAGGCCCG TGATGAAGGA AAAGTTTTGC GCTATGTTGG CAATATTGAT GAAGATGGTG CCTGCCGCGT GAAGATTGCC GAAGTGGATG GTAATGATCC GCTGTTCAAA GTGAAAAATG GCGAAAACGC CCTGGCCTTT TATAGCCACT ATTATCAGCC GCTGCCGTTG GTGCTGCGCG GATATGGTGC GGGCAATGAC GTTACAGCTG CCGGTGTCTT TGCCGATCTG CTACGTACCC TCTCATGGAA GTTAGGAGTC TGA
|
Protein sequence | MRVLKFGGTS VANAERFLRV ADILESNARQ GQVATVLSAP AKITNHLVAM IEKTISGQDA LPNISDAERI FAELLTGLAA AQPGFPLAQL KTFVDQEFAQ IKHVLHGISL LGQCPDSINA ALICRGEKMS IAIMAGVLEA RGHNVTVIDP VEKLLAVGHY LESTVDIAES TRRIAASRIP ADHMVLMAGF TAGNEKGELV VLGRNGSDYS AAVLAACLRA DCCEIWTDVD GVYTCDPRQV PDARLLKSMS YQEAMELSYF GAKVLHPRTI TPIAQFQIPC LIKNTGNPQA PGTLIGASRD EDELPVKGIS NLNNMAMFSV SGPGMKGMVG MAARVFAAMS RARISVVLIT QSSSEYSISF CVPQSDCVRA ERAMQEEFYL ELKEGLLEPL AVTERLAIIS VVGDGMRTLR GISAKFFAAL ARANINIVAI AQGSSERSIS VVVNNDDATT GVRVTHQMLF NTDQVIEVFV IGVGGVGGAL LEQLKRQQSW LKNKHIDLRV CGVANSKALL TNVHGLNLEN WQEELAQAKE PFNLGRLIRL VKEYHLLNPV IVDCTSSQAV ADQYADFLRE GFHVVTPNKK ANTSSMDYYH LLRHAAEKSR RKFLYDTNVG AGLPVIENLQ NLLNAGDELM KFSGILSGSL SYIFGKLDEG MSFSEATTLA REMGYTEPDP RDDLSGMDVA RKLLILARET GRELELADIE IEPVLPAEFN AEGDVAAFMA NLSQLDDLFA ARVAKARDEG KVLRYVGNID EDGACRVKIA EVDGNDPLFK VKNGENALAF YSHYYQPLPL VLRGYGAGND VTAAGVFADL LRTLSWKLGV
|
| |