Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1157 |
Symbol | pyrB |
ID | 4602159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1097173 |
End bp | 1098120 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639773933 |
Product | aspartate carbamoyltransferase catalytic subunit |
Protein accession | YP_920558 |
Protein GI | 119720063 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0540] Aspartate carbamoyltransferase, catalytic chain |
TIGRFAM ID | [TIGR00670] aspartate carbamoyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.111076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTTTCGG GTTTAGGGTC TCGTGGGAAC CCCTTCTACG GTAGGGACGT CCTGTCGATA CTCGACTTCT CGAGGAGCGA CCTCGAGTAC CTCTTCGCGG AGGCGGACAG GGTTCGGCGC GACCCCTCGG CGTTCAGCGG GGAGCTGAGG GGCTACGTGT TGGCGACTGC CTTCTTCGAG CCCAGCACCA GGACGAGGCT GAGCTTCCAG GCGGCGATGC TGAGGCTCGG CGGCTCCTGC ATAGACCTGG GCGAGCTCGA GAAGAGCTCT ATAGCTAAGG GGGAGAACTT CGCGGATACC GTGAGGATGC TCGACGCCTA CGCGGACGTC ATAGTCGTTA GGCACAGGCT TGAGGGGGCG GCGAGGTTCG CGGCGGAGGT GGCGGAGAAG CCGGTTATAA ACGCCGGCGA CGGAAAGAGG CACCACCCCA CGCAGGCCAT GCTCGACCTG TACTCCGTCA AGACGCTGAA GGGCTCTGTG GACGGGCTGG TCTACGGGGT TCTCGGGGAC TTGAAGTACG GGAGGGCTGC CGCGAGCTTC ATCCTGGGGC TTTCCCTGTT CAAGCCGAGG AAGGTCTACC TTATATCGCC GGGGCTTCTC AAGGCGCGGG AAGACGTGAA GGAGGCCTTG AGGGAGAGGG GTGTGGGCTT CGAGGAGGTA GAGTCCCCGT CGGAGGTGAT CGGCGAGCTC GACGTGCTCT ACGTTACGAG GATCCAGCGG GAGCGCTTCC CGGACCCCTC CGAGTACGAG AAGGTCAGGG GTAGCTACGT CGTGGACTCG AAGTTGCTGA GGAACGCTAA GGAGGGCTTG ATCGTGCTCC ACCCGCTTCC ACGCGTAGAC GAGATCTCGT TCGACGTCGA CGGGACTCCT CACGCCAAGT ACTTCGAGCA GGCAAGGCTG GGCATCCCCC TGAGGATGGC TTTGCTGAAG CTCGTCTTGA AGGGGTGA
|
Protein sequence | MVSGLGSRGN PFYGRDVLSI LDFSRSDLEY LFAEADRVRR DPSAFSGELR GYVLATAFFE PSTRTRLSFQ AAMLRLGGSC IDLGELEKSS IAKGENFADT VRMLDAYADV IVVRHRLEGA ARFAAEVAEK PVINAGDGKR HHPTQAMLDL YSVKTLKGSV DGLVYGVLGD LKYGRAAASF ILGLSLFKPR KVYLISPGLL KAREDVKEAL RERGVGFEEV ESPSEVIGEL DVLYVTRIQR ERFPDPSEYE KVRGSYVVDS KLLRNAKEGL IVLHPLPRVD EISFDVDGTP HAKYFEQARL GIPLRMALLK LVLKG
|
| |