Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0295 |
Symbol | |
ID | 4601304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 262187 |
End bp | 263464 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639773053 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_919708 |
Protein GI | 119719213 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00215274 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCCG GAAATACTGT AGCAGGTAAA AGCAGTTTAG CAAGGGTGCT TGCATACATC GTGATAGCTT TACTTATAGG CGCCTTCCTG GGCTACTTCC TGCGAGGATA CCCGGCACAG CAACAGGCGG CTACCACGCA AACATCGGTA ACAACGATAC CTATAGGAGC GCTCGTTGAG CTTTCTGGTG ATCTCTCGTC TTACGGTAAG AGAGACGAGC TAGCAATGCA GATAGCCATC GAAGACGTTA ACAACTTCGC CGAGAAGATA GGCTCGCCGT ACAGGTTCAA GTTGCTCGTA GAGGACTCCG GAACTAGCCC GGAGCAAGCC CTCTCGAGGA TTAAGACTCT GGCCGCGCAG GGTGTACGCG CAGTTATAGG GCTAGAGGCC AGTAGCGAGG TAGCAGCAGT GAAGCAGTTC GCTGACACGA ACCACGTAGT CGTTTTAAGC GTAGGATCGA CAGCTTTGTC GCTGGCAATC CCCGGCGACT ACATACTAAG AGTTGTACCA CCAGACAGCG TGCAAAGCAA GGCTCTGGCC CGCCTTATCT ACTCGCTGGG ATACCGGAAC GTGGCGGTGA TATACCGCAA CGACGCATGG GGTGTAGGTC TCTTTGAAGG GTTCAGCGCG AGGTTCAAGG AGCTGGGAGG GAACGTGGCG GGGGTTGCGT ACGACCCTGC GGCTAAGGAT CTCAGCGGTG AGGTGAACAG GCTCGCGGAC ATAGCGGCTA GCATGGGGTC TAACACGGCG GTGCTCGCCA TAACGTTCGA GGACGACGGC ATACAGATAG TGAAGCTAGC GGCCAGGAAC CCTGTCCTCT CCAAGCTCAA GTGGTTCGGC ACAGACGGCG TGGCTCAGTC GACCAAGCTT GCAAGCGAAG CTGGGGAAGA GCTAATAGCT CTGGGAGGCT TTCCGTGCAC GATATTCCAG CCTTCCGAGA ACCAGCGTCT AGCAGACTTC GTGAACAGAT TCCGTAGTAG GAGCGGGGGC GAGGATCCAC ACGCGTACGC CATGAACGCC TACGACGCAG TCTGGCTCGT AGCCCTATCG GTGATGCTTA CAGGCTCCTA CTCGGGAGAC AAGCTACTAA GCACTATTCC ACTCGTCGCC CAGAACTTCA ACGGAATCAC GGGACCGCTC ACGCTGGACG CAAACGGAGA CAGAGCTTCG GGAGACTACG CTATATGGCG CGTCGTTAAA ACAGCAAACG GCTACGACTG GCAGATAATA GGATGGTACA GCGCGTCCTC CGATAGCGTT ACAATCCAGG GAGGCTAA
|
Protein sequence | MSAGNTVAGK SSLARVLAYI VIALLIGAFL GYFLRGYPAQ QQAATTQTSV TTIPIGALVE LSGDLSSYGK RDELAMQIAI EDVNNFAEKI GSPYRFKLLV EDSGTSPEQA LSRIKTLAAQ GVRAVIGLEA SSEVAAVKQF ADTNHVVVLS VGSTALSLAI PGDYILRVVP PDSVQSKALA RLIYSLGYRN VAVIYRNDAW GVGLFEGFSA RFKELGGNVA GVAYDPAAKD LSGEVNRLAD IAASMGSNTA VLAITFEDDG IQIVKLAARN PVLSKLKWFG TDGVAQSTKL ASEAGEELIA LGGFPCTIFQ PSENQRLADF VNRFRSRSGG EDPHAYAMNA YDAVWLVALS VMLTGSYSGD KLLSTIPLVA QNFNGITGPL TLDANGDRAS GDYAIWRVVK TANGYDWQII GWYSASSDSV TIQGG
|
| |