Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0509 |
Symbol | |
ID | 4601006 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 462868 |
End bp | 463797 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 639773277 |
Product | putative agmatinase |
Protein accession | YP_919918 |
Protein GI | 119719423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGGAG CATTGGAGCT TTACTCGCGG AGTGTGTCTC CAAAGCTCTT CGGAAACGAG TGTAGCTTTG AAGAGAGTGC TTACGTCGTC GTGGGTGCTC CTTTTGACTC AACGGCGACC GGTGTACCCG GACAAAGGTT TGCGCCTCAA AGAATACGTG AAGTCTCGGT AGAGCTCGAA ACCTTTCTTC CCGACTTGCA AATAGATGTA GAGCGCCTCG CCGTCAACGA TGCTGGAGAC CTACCCGTAT TAACCAGCGT AGAAGCCTTG ATAGATATCC TGGGCGGAGT CGCCAGGGAA GTCTTGTCGT CCGGGAAAGG GCTCATCATG CTCGGTGGGG ACCATTTCTC GTCCTTCCCC GTCCTCGTTG AAGCAGCAGG CAACGTGGAC GAACTAGGCG TGCTCGTCTT CGATGCCCAC CTAGATCTTC GAGACGAATA CCCCGTCAAG TGCAGGTACA GCCATGCTAC GGTCTTCCGC CGGTTAATCG AACAAACGAA AAACGTATAC ATCGCGTACT ACAAGCCGCG CGGATGGAGC AGCGAAGAGT ACGAATTCAT GCGTCAATCG CCCAGGAAGC TCGCCGTTCT AAACACCCGG GAAGAGGTAG AAAGCTGGGT AAAGGAGCAC AGGACGGTAT ACGTATCTAT CGATATCGAT GCATTGGACC CCGCGTATGC CCCCGGGACG GGTACTCCTG AACCTATCGG CCTCGCTCCC AGGGAGCTCG TTGACTCCCT ACGCGCGGCG GTTCTCGGCT CAGAAAAACT CCTAGGAATA GACGTAGTAG AGGTTAACCC GCTAGTGGAT GTTAACAACT TGACGTCAAG GGTTGCCGCG AAAATACTAC TCGAAGCACT CGCAGCCATG TACGAGGCTA GGAAGATAGG CGCCTGCACG CGAACTTCTA AACACTTCTT AAACGGTTAA
|
Protein sequence | MYGALELYSR SVSPKLFGNE CSFEESAYVV VGAPFDSTAT GVPGQRFAPQ RIREVSVELE TFLPDLQIDV ERLAVNDAGD LPVLTSVEAL IDILGGVARE VLSSGKGLIM LGGDHFSSFP VLVEAAGNVD ELGVLVFDAH LDLRDEYPVK CRYSHATVFR RLIEQTKNVY IAYYKPRGWS SEEYEFMRQS PRKLAVLNTR EEVESWVKEH RTVYVSIDID ALDPAYAPGT GTPEPIGLAP RELVDSLRAA VLGSEKLLGI DVVEVNPLVD VNNLTSRVAA KILLEALAAM YEARKIGACT RTSKHFLNG
|
| |