Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1043 |
Symbol | |
ID | 4600948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 982898 |
End bp | 984220 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639773821 |
Product | major facilitator transporter |
Protein accession | YP_920446 |
Protein GI | 119719951 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTACACG AGGCGCGCGC AGCGTGTAAA AAAGTTTTAA TTCCCGAGGG TTTTTCTCGT TTCCCCGTGG GTGGATGGCT GGGGAGGGTT AGGGAGTCCC TTGGAAGGGT TCCTAGGAAC ATCAAGGTGC TAGCGCTAGG CTGGCTCGTC TGGTCGCCGG TACAGGCGAT GGCGGGGCCG TACACGCAGC TCTACGTTAG CCGCCTGGGG GCATCTCCCG AGGACATCTC GCTGGTGCAG TCAGCAACGC AAGTGGCGAA CGCGCTTTCA AGGATAGTCG GCGGCTTCCT CTCGGACAGG TACGGTAGGA AGAGGGTTCT GTGGGTGGGG ACGTTCCTCG TAGCGCTCGC ATACCTCTTG ATGAGCGTTG CAACCGACTG GCGGAGCTAC GCCATTGCCA GCGTGCTCAA CGGCTTCGCC CTCTTTTACC AGCCTGCGCT CGAAGGGATA CAGGCGGACT CGGTTCCCTT GCACCTCAGG GGGAGGATGA ACGCACTCCT ACACCTGGTG CCGGGCCTGG CATCCTCGCT GTCCCCGCTC GGCGGAGCCG CATTGGTAAA TGCTTACGGG CTCGTCGGCG GCGTGAGGGT GATATTCTTC CTCTCGTTCG TTACCGGCGT AGCCATCGCG GTTGCAAGGC TACTATGGAT AGAGGAGACC CTCGAGCCGA GGGGCTCCGG GGTAAACATG CTCGAATCCT ACGTGGACGC CTTGAGGCAC GTGTCCCGCG ATGTCTACAC GCTGATAACG CTGGACACCC TCTTCAACCT CGTGGGCGCG ATGTCCTTCC TCTCGAACTA CTACATGTAC TACTACCTCG GGGTCGACAA GAGCGAGCTA GCCATGCTGG CCTCCCTGGG AAGCCTCGTG AACCTAGGCC TGCTGATACC CGCCGGGAGG GCTGTGGACA CCAGGGGGAG GAACTTCTCG ATAACCCTGG GCTTCCTGAT GGGCACGCTG AGCCAGCTAT TCTTCGTCCT CTCCCCGCCA TCCTCCAGCT TCACGCTACC GGCACTCATC GTCTCCACGC TTTTCGGAGC GGTGGGAGGA GCCTTCTACG GGCTCGCGTA CTCGTCTCTC AGGGCGGACC TCGTAGCGAA AGAGTACAGG GGGAGGATCT ACGCCCTCTG GGGGCTCGCG CCGGCGGCGA GCTGGAGCCT AGGCGCGTAC ATAGGGGGGT GGATGTACAG CAACCTGGGG CCCCAAACCC CCTTCGTTGC CAGCTTCATG CTCAGAGTCC TGCTCACCCC GCTCGCCCTC ACCCTCTTCG GGAAGCTCAC GAGGAAAGTC GACCTAGCGT TGCGAAACGT CGAGGGAAGC TAA
|
Protein sequence | MLHEARAACK KVLIPEGFSR FPVGGWLGRV RESLGRVPRN IKVLALGWLV WSPVQAMAGP YTQLYVSRLG ASPEDISLVQ SATQVANALS RIVGGFLSDR YGRKRVLWVG TFLVALAYLL MSVATDWRSY AIASVLNGFA LFYQPALEGI QADSVPLHLR GRMNALLHLV PGLASSLSPL GGAALVNAYG LVGGVRVIFF LSFVTGVAIA VARLLWIEET LEPRGSGVNM LESYVDALRH VSRDVYTLIT LDTLFNLVGA MSFLSNYYMY YYLGVDKSEL AMLASLGSLV NLGLLIPAGR AVDTRGRNFS ITLGFLMGTL SQLFFVLSPP SSSFTLPALI VSTLFGAVGG AFYGLAYSSL RADLVAKEYR GRIYALWGLA PAASWSLGAY IGGWMYSNLG PQTPFVASFM LRVLLTPLAL TLFGKLTRKV DLALRNVEGS
|
| |