Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1576 |
Symbol | |
ID | 4600562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1528285 |
End bp | 1529568 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774349 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_920974 |
Protein GI | 119720479 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0597639 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCAA CGCAATCTCA GCAAAAGGGA AAATTCGCCT TTGACGTAAA GCTCGTAGCA GTCGCGATAA TCGCCCTCAT AGTCGGAGTA GCCATAGGCG CGGCGATCTT CGGAGGAGCG GCGGGGCAGG CCCCCGGCGC GGCGCAACAG GCTCCCAAGA AAGTCTACAC TATAGGCTTC ACGCTCCCGC TGACCGGCGA GCTCTCGTCC ATCGGGAAAA TTTGGGAAAA AGTGGTCTAC CTGGCAATAG ACGACCTGAA CAAGGAGGCG CAGGCTTACG GGTTTAACGT AGAGTTCAAG GCCGTGATAC TCGACGACGG GACTACGCCC GAGAAGGCCC TCCAAAACGT CCAGACGCTC GCACAGCAAG GAATAAAGGT CATAATAGGA CCCGCCGCAA GCTCGCAGGT TAAGGCCGTG AAAGGATTCG TCGATAGCAA CCAGATAGTC TTAATCTCGC CGTCCTCCAC TGCTCCCACG CTCGCAATAC CCGGCGACTT CATCTTCAGA ACCGTCGGGT CCGACGCCGG GCAGGCAAGA GTGCTCGCAA CGCTCGCCTA TCAGGAGGGC GCCAGGAAGG TCATAGTGTT CCACAGGAAC GACGAGTACG GTAACGCGTT CGCGGACTTC TTCAAGAAGT ACTTCTCCGA GCTTGGCGGA AGCTCCATTG ACGTTCCCTA CCAGACCGGC CTCTCCGACT ACGCCGCAGA GGTCGCCAGC CTCGCCAGTA AGGTGCAGTC CGAGAAAGTA GACGCCGTCG TGCTGATATC GTTCGACACG GACGGCGGCA ACATACTGTC GCACGCCGCC GAGTCGCCAG TCCTCTCCAG CGTGAGGTGG TTCGTCTCCG AGGGTCCCCA CGGAGCCGCA GAGCTCAAAG CCCCGGCTGT CGGGGCTTTC GCCGCTAAGA CGAAGTTGCT GGGGACACGC CCACTCTTCA TAGGCAACCC GCTCTACGAG GACTTCAAGA AGAGACTTAA GGAGAAGTAC GGGGTGGACG CCTCCGTGTT CTGCGATACG CTCTACGATG CAGTTATGCT CGCGGGCTGG GCTATGCTGA GGGCTGGTAG TAGCGATGGC AACGCTATAC GGAGCGCGCT GATCGAAGTC GCGAAGCACT ACTACGGAGT GAGCGGCTGG GCAATATTCG ACGAGGCGGG CGACAAAGCG TACCAGGACT ACGGAGTATG GGCGATAGTC AAGACCGACG GGGGCTACGA CTTCAAAGAC GTAGGGGTAT ACGAGAGAGG CTCCATAGTG TTTACGGCTA AACCGTACCC GTAA
|
Protein sequence | MSATQSQQKG KFAFDVKLVA VAIIALIVGV AIGAAIFGGA AGQAPGAAQQ APKKVYTIGF TLPLTGELSS IGKIWEKVVY LAIDDLNKEA QAYGFNVEFK AVILDDGTTP EKALQNVQTL AQQGIKVIIG PAASSQVKAV KGFVDSNQIV LISPSSTAPT LAIPGDFIFR TVGSDAGQAR VLATLAYQEG ARKVIVFHRN DEYGNAFADF FKKYFSELGG SSIDVPYQTG LSDYAAEVAS LASKVQSEKV DAVVLISFDT DGGNILSHAA ESPVLSSVRW FVSEGPHGAA ELKAPAVGAF AAKTKLLGTR PLFIGNPLYE DFKKRLKEKY GVDASVFCDT LYDAVMLAGW AMLRAGSSDG NAIRSALIEV AKHYYGVSGW AIFDEAGDKA YQDYGVWAIV KTDGGYDFKD VGVYERGSIV FTAKPYP
|
| |