Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_0299 |
Symbol | thrA |
ID | 6373954 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 291552 |
End bp | 294011 |
Gene Length | 2460 bp |
Protein Length | 819 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642682813 |
Product | bifunctional aspartokinase I/homeserine dehydrogenase I |
Protein accession | YP_001958749 |
Protein GI | 189499279 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0460] Homoserine dehydrogenase [COG0527] Aspartokinases |
TIGRFAM ID | [TIGR00656] aspartate kinase, monofunctional class [TIGR00657] aspartate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0106058 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTTC TAAAATTTGG CGGCACCTCG ATAGAGAATA GTGAAAGGAT CAGAAATGTC CTGGGTATTA TTCGCGGCGC GATAAAGGAT TCACCTGTTA TCATTGTAGT GTCCGCTATA CGAAAGGTTA CAGATCTGCT TTTGGAAGCC GCGGTTGCCG CGGGAAGCGG CGATGCCGGT TACAGGGAAA AGCTTGTTAC TATTGAAAAT ATTCACGGAG ATCTTGTTCG TGATCTGATT GACCTCTCCC GCAGAAACGA GGTTCAGGAA GTGTTGACCG ATGAACTTCA GGAGCTTGGT GATATACTCT ATGGTGTCAG CCTTCTTCGT GACCTCTCTG ACCGGAGCAA GGCGCTGATT CTGAGTTTCG GTGAACGCTT TTCAGCAAGG ATTATCAGCA CCTTCTTCTG TCAGGAAGGC CTGGACGCTT CATATACCGA TGCGAGAAAG TTGATTGTTA CCGACACCAA TCATTGTGAT GCGCGGGTTG ATATGAGCGC TTCTTCGGAA CTGATCAGCG CATGGTTCAA AGAGGAGCGC GGAGTGCCTG TCGTGACGGG TTATATCGGT GCCGCTCCTG ATGGAACGGC GACAACCCTT GGAAGGGGGG GATCAGACTA CACCGCTACC ATACTCGGTT CCGTTGCCGG TGCTGACGAA ATCCAGATAT GGACCGACGT GGATGGCTTT TTCAGCGCAG ATCCTAAACG GGTCAAGGAC GCCTATGTTC TGCCGTTTAT CAGCTATGGC GAGGCGATGG AGCTTTCCCA TTCGGGAGCC AAAGTGCTGC ATCCATATTC AGTTCATCCG GCCATGAAGA AGGGAATTCC GATCACTATC AGGAATTCCT ACAATCCTGA CGTAGAAGGA ACACGTATCT CCGCGCCTGA AGGAAACGAT ACAGGCTCCG GAAAGCCGGT GACCGGCCTC AGCTCAATCA ATGACGTTGT GCTGCTGAAC TTTTCCGGGA GCGGCATGGT TGGAGTGCCG GGTATTGCGT CAAGGCTGTT CAGCTGCCTG GCACGTCACA AAATCAATAT AATTTTTATT TCACAGGCCT CTTCAGAGCA GTCCATAAGT CTCGCCATCA ATCTTGTCCA GGCGGAAAAA GCACGTCTTC TTCTCGAGCA GGAGTTCGCG GCTGAACTTG CGGTGCGTCA GATTGAATCC CTGACATTCC GAAAGCATAT CGCCATCATC GCTGTTGTCG GCAAGCAGAT GCCGGGGCAT CCGGGTGTTT CGGCCCATCT TTTCGAGACG CTCGGAAAAA ACGGTATCAA TGTCATAGCC GTGGCTCAGG GGGCGAATGA GATGAATATC TCATTCGTGA TCGACAGCCA TGATGAAGAC AAGGCGCTTC ATTGTGTGCA CGAGTCGTTT CTGCTCTCCC GCCGGAAGGT GCATGTTTTT ATCGCGGGAA CCGGAACTAT CGCAAAAAGC CTTATCGGGC AGATTCGTGA TCACAGCCTT ACGCTGAGAA AGGAGAAGGA GCTGGATGTG GTCGTAAGCG GCATGGCGAA TACCCGGATG CATGTGAGCG ATGATGCCGG CATAGATCTG AGCCGATGGG AGAGCGGTCT GAAGCCGAGG ACCGATGGAA AAACGGTGAG CGACTATGTA GACTATATCA AGTCACGGAA TCTGCATAAT ACCATATTTG TGGACTGTAC GGCAAGTGCG GAGGTTGCTG CATGCTATCC TGATCTTCTT GCCTCGAACA TCTCTGTCGT AACGGCAAAC AAGCTTGGAA CGGCAGGTTC ATGGGAACTC TATGAGACCA TATCCGAGGC GCTGCATGCC TCAAATGCCC GTTTTCTCTA TGAAACCAAT GTGGGCGCGG GACTTCCCAT CATCAATACA CTGAACGATC TGAGAAACAG CGGTGACAGG ATCGTCAGGA TTGAGGGAGT GCTTTCAGGG ACACTGAGTT ACATATTCAA TGAACTTCGC AAAGGGCGGA AATTCAGTGA GATCGTCAGG AGCGCAAGAG ATGCCGGCTA CACTGAGCCT GACCCTCGAG AAGATCTTTC CGGTGCTGAT TTTGCGAGAA AGTTTCTCAT TCTCGGCAGG GAACTCGGTT ACAGGCTTGA TTATGAGGAT ATCGAGTGTG AGAGTCTTGT TCCGGAATCT CTGAGGGGAG AGATGAGTGT TGAAGAGTTC ATGGAAAGGC TGGGCGGTAT CGACGCTGCA TATCAAACCA GAATCAGTGA GGCCGCTGAA ACGGGCATGA CGATTGCATA CGCGGGTGAA ATCAGTGAAG GTAAAGCGCG TATCGGTGTA AAAACGCTGC CTGTATCGAA TCCTGTCGCG GGTTTGAACG GCACGGAAAA TCTGGTGGTG TTTACCACGG ACCGCTATTT GGATACTCCG CTTGTGGTCA AAGGGCCTGG TGCAGGAGGA GAGGTTACCG CAGGAGGCGT GTTTGCCGAT ATTCTGCGCA TTGCAAGCTA TCTTATATAG
|
Protein sequence | MKVLKFGGTS IENSERIRNV LGIIRGAIKD SPVIIVVSAI RKVTDLLLEA AVAAGSGDAG YREKLVTIEN IHGDLVRDLI DLSRRNEVQE VLTDELQELG DILYGVSLLR DLSDRSKALI LSFGERFSAR IISTFFCQEG LDASYTDARK LIVTDTNHCD ARVDMSASSE LISAWFKEER GVPVVTGYIG AAPDGTATTL GRGGSDYTAT ILGSVAGADE IQIWTDVDGF FSADPKRVKD AYVLPFISYG EAMELSHSGA KVLHPYSVHP AMKKGIPITI RNSYNPDVEG TRISAPEGND TGSGKPVTGL SSINDVVLLN FSGSGMVGVP GIASRLFSCL ARHKINIIFI SQASSEQSIS LAINLVQAEK ARLLLEQEFA AELAVRQIES LTFRKHIAII AVVGKQMPGH PGVSAHLFET LGKNGINVIA VAQGANEMNI SFVIDSHDED KALHCVHESF LLSRRKVHVF IAGTGTIAKS LIGQIRDHSL TLRKEKELDV VVSGMANTRM HVSDDAGIDL SRWESGLKPR TDGKTVSDYV DYIKSRNLHN TIFVDCTASA EVAACYPDLL ASNISVVTAN KLGTAGSWEL YETISEALHA SNARFLYETN VGAGLPIINT LNDLRNSGDR IVRIEGVLSG TLSYIFNELR KGRKFSEIVR SARDAGYTEP DPREDLSGAD FARKFLILGR ELGYRLDYED IECESLVPES LRGEMSVEEF MERLGGIDAA YQTRISEAAE TGMTIAYAGE ISEGKARIGV KTLPVSNPVA GLNGTENLVV FTTDRYLDTP LVVKGPGAGG EVTAGGVFAD ILRIASYLI
|
| |