Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphamn1_1767 |
Symbol | pyrB |
ID | 6375454 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides BS1 |
Kingdom | Bacteria |
Replicon accession | NC_010831 |
Strand | + |
Start bp | 1912276 |
End bp | 1913325 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 642684260 |
Product | aspartate carbamoyltransferase catalytic subunit |
Protein accession | YP_001960166 |
Protein GI | 189500696 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0540] Aspartate carbamoyltransferase, catalytic chain |
TIGRFAM ID | [TIGR00670] aspartate carbamoyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00166566 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.704946 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAG TAACCGGTAA ATGTTCACAA ACCATCACCT GCAGGCATCA AATCTTAAAA GATTTTGTAC ATTTTCTGAA CCAAACAACC ATCACTGAAT TTATCTGGAA GACTGCCGCA TTGAAACATC TTACTGGATT ATCAGGTATA TCCCCGGCGA CAATTACCGG AATTCTTGAC AAAGCCGCCC TACATAAAGA TCTTTTCCTG CATGCTGAAA ACAAGATTCC CCGCACACTT CAGGGAAAAC GTATCGTTCT CGCCTTTTTT GAGAACTCTA CCCGGACAAG ATTTTCTTTT GAAATAGCGG CACGAAATCT CGGCGCTTCC ACGCTCAATT TCAGCGCGTC ATCGAGCAGC GTCAGCAAGG GCGAAAGCAT CGTGGATACC ATAAAAAATC TGGAGGCTAT GCAGGTCGAT GCTTTTGTCA TCCGCCATCC GTCATCCGGC TCGGCCGAAC AGATAAGCCG TATCACCGAC AAACATGTCA TTAACGCCGG TGACGGCACT CATGAACATC CTACCCAGGC TCTGCTCGAC ATCTTTACCC TGAGGGATTA TTTCGGGTCA CTTCAGGATA CCAGGATCAT GATTCTGGGA GATATACTGC ACAGCCGTGT CGCCCGTTCA AATATTTTCG GACTGACCGC GCTGGGGGCA AATGTCGGCG TCTGCAGCCC GGTGTCGCTT CTTCCGGCAG ATATATCATC TCTCGGTGTC AGGATATTTA CAGGAATCGA TGACGCCATC AGGTGGGCTG ACGCCGCGAT TGTGCTGAGA CTCCAGCTTG AAAGAACAAC AGGGGGCTAT CTGCCCTCCC TTGAAGACTA CTCGCTTCAT TTCGGCCTTA CCGATGAACG CCTCGAAAAA ATACGCAAAC ACATGCTTGT TCTTCACCCC GGACCGATTA ACAGGGAAAT TGAAATATCG AGCAACGTGG CTGACCGTAT CCAGCCTCCC GGTTATTCGA AAAGCGTACT GCTCGAACAG GTAACCAACG GCGTGGCCGT CCGAACCGCA GTTCTTGAAA TGCTCTTCAC TGAAACATAA
|
Protein sequence | MLKVTGKCSQ TITCRHQILK DFVHFLNQTT ITEFIWKTAA LKHLTGLSGI SPATITGILD KAALHKDLFL HAENKIPRTL QGKRIVLAFF ENSTRTRFSF EIAARNLGAS TLNFSASSSS VSKGESIVDT IKNLEAMQVD AFVIRHPSSG SAEQISRITD KHVINAGDGT HEHPTQALLD IFTLRDYFGS LQDTRIMILG DILHSRVARS NIFGLTALGA NVGVCSPVSL LPADISSLGV RIFTGIDDAI RWADAAIVLR LQLERTTGGY LPSLEDYSLH FGLTDERLEK IRKHMLVLHP GPINREIEIS SNVADRIQPP GYSKSVLLEQ VTNGVAVRTA VLEMLFTET
|
| |