Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1385 |
Symbol | |
ID | 4600668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1339670 |
End bp | 1340887 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639774160 |
Product | major facilitator transporter |
Protein accession | YP_920785 |
Protein GI | 119720290 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0632886 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCGG AGAGGAGGTT CAGGTTCGCG GTCTCCTACG TACCCGTCTT CGTTGCCAGG ATAGGGTCCG GCGCGAACAC GTTCCTGGTA GCCATGCTCG CGAGCGGCGG GGACGTCGCG GCGGGCTTCG TTATGGCGGC GTACCCGCTC ATGGAGGCTA TAGGCGCCCT ACTCGCCGGC TCGTGGTCCG ACTTCGCGGG GAGGAAGCGC ACGCTGATAG CAGGCTACGT GGCCAGGTCG GCCGGGATGC TCTCCCTGGC GGCGGTATTC GCGCTTTCGA GCTCTTCCCC CGGGAGCCTT GTGTTCGCCG CGCTGGAGGC CGCCTTGAAC GGCGTCGTCG GCTTTACAAC AGCCCTGATA CTCGTCTCTT CGCTCTCGAT GGCCACCGAC CTCACCGAGA CGGGGAACAG GGGGCTTGGG ATGGGGGGCT TCGAGTTCGT GAACCTCGCG AGCTACGGCG CCGGGTACCT CCTGGGCTCG CTTCTCTACA GCGTGCTCCC CGGCTACCCC GCCTACCTAG CCGTGGCGCT CCTCACGGCC CTGGCGACCC CGGTCTTCGC GGCGTTCCTC GAGGAGACGA GGCCCCCCGT CCCCCTGGTG AGGAGGGGTC TTCTCTCCAG CCTACCGAGG TCGGCGCTCG TACTGCTACC CGTCTGGGTG GCGCTCACGA CGATAATAGG CATCGGTATC TACGCGCCCA GGGTTATCCA CGACCACCTA GGCGCGGTGC ACGGCCCAGC AATCGGTAAG GCCGGGATAG GCCTGCTGTT CCTCGGCGCC CTCGTGCTCC TCGGCTCCGG CGCGGTGTTC TTCGGGAGGC TCTCGGACTC CTGGGGGCGC GTCAAGGTAT TCAGGCTCGG CCTAGCCGGG GGCCTCGCGG CGCTTTCAGC CATGAGCGTC CTCCTCGCCC TCGGCGTAGA CATCCTCAGG GCTGCCGCCG CCGTAGCCCC CCTGATGTTC CTCACGTCCG CCGTAGGGCC CACGATACTC GCGCTGGTCG GGGACCAAGC GTCCACGGAT GCCCGCGGCA GGGTGATGGG CGTGTACAGC GTGGTCCTAG GGCTCGGGAT GGGGCTCGGG AGCATACTGG CCGGGCTCGC CTCCAGCGCC CTCCCGTACA ACAGGCCCCT TGCCCTCTCG CTCGTAGCGC TCGCGGCTTA CTCGGTGGCC GCGCTCGCGC ACCTCTACCT GGAGAGGCGG CTCGGCGGCT CCCTGTAG
|
Protein sequence | MSAERRFRFA VSYVPVFVAR IGSGANTFLV AMLASGGDVA AGFVMAAYPL MEAIGALLAG SWSDFAGRKR TLIAGYVARS AGMLSLAAVF ALSSSSPGSL VFAALEAALN GVVGFTTALI LVSSLSMATD LTETGNRGLG MGGFEFVNLA SYGAGYLLGS LLYSVLPGYP AYLAVALLTA LATPVFAAFL EETRPPVPLV RRGLLSSLPR SALVLLPVWV ALTTIIGIGI YAPRVIHDHL GAVHGPAIGK AGIGLLFLGA LVLLGSGAVF FGRLSDSWGR VKVFRLGLAG GLAALSAMSV LLALGVDILR AAAAVAPLMF LTSAVGPTIL ALVGDQASTD ARGRVMGVYS VVLGLGMGLG SILAGLASSA LPYNRPLALS LVALAAYSVA ALAHLYLERR LGGSL
|
| |