Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1529 |
Symbol | |
ID | 4600666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1475620 |
End bp | 1476804 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639774303 |
Product | major facilitator transporter |
Protein accession | YP_920928 |
Protein GI | 119720433 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.324526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCGCGTA AGGCTGAAAC TCAGAGCGGA GAGAGAGCGG AGCCCCGCGC CTCGACGGGC GGTGGAAGAG CGGAGAAAGT GCTCGTAGCG TACGGGCTTG CGAGTAGCTT CGCCGGTAAC CTCGTATCGC CGTTCATGGC TCTCTACCTC TACGGGCTCG CCGGCGGGAG GTTTTTCCAG GCAGGTATTG CCAGCCAGGC CCCGGTGGCG GTCTCAGTGA TCATGGGGCT CTTCTGGGCC AGGCTGTCTG ATACTAATGG GTCGAGGAAG AAGTTCGTTC AGCTGAGCTT GGCTACGGGC TCCCTGGCTT CCCTGGCTCT CAGCTTCGCG TCAAGCATTA ACGATGCGAT ACTCGTCCAG ATACTCGGGG CGTTCACCGG GAGCGCTGGG GGAGCGGCGT TCTCCGCCTT GATGGCGGAG GTCTTCAAGG ACAAGAGGGG GTCGAGGCTC GGGGTTTACA ACGCGTCCAC GGTCATCGGC GGCTTCGCCG GTAGCATGCT TTCAGGGTTC CTGTACAACT GGATAGGCTT CAGGTGGATG CTCAGGCTGA ACGCCTTGCT CGGGGTCTTG CCCCTAGTCC TGATAAGCAT GATCCCCGAA GAGGGTAACC GGAACCCCGT TAGCTCGAGG AAGCTTGTAA GCATCCCGAA GATACCCCGG AGGTTCTGGA AGCTCTACCT AGCTAGGCTC GTGCTCAGCC TCCCCGGGGC TCTGAGCGGA GGAGTGTTCT CGGTGTACTT CGTGAAGTAC CTTAACGGCC CGCCTGAAGC TTGGTCGACG CTCGTGGCAG TCACAACTCT CTTCGGGCTA GCGTCCATCC CTTACGGCAG GCTCGCCGAC AGGCTCTCGA CGCGGGAGAT GTTCGTGCTC GCAGGGCTCG GGTGGACCGC TCTCTACCTG GGGTACTTTC TGTCCCCGAA CTACCTCGTC TTCTCCCTGT TCTTCGTGAT ACCGGTGTGG CCGGCGTTCT GGCTCGCGTA CTCCAAGGCC TTGATGGACC TGAGCGACGA GTCGGAGAGA GCTACGTTCT ACGCCTTCGA GGGGACGCTC TCGGCGCTAT TCGGGTCCGC GGTCGGCGTA GCCGCGGGGC TCGTCGCGGA CCTCTACAGC CCGAGAACCC TCTTCCTGCT GTCCTCCGCC TCCGCGTTGC TGGGCGCCGC CGTAGCAGGC GTACTGCTCA GGTAG
|
Protein sequence | MSRKAETQSG ERAEPRASTG GGRAEKVLVA YGLASSFAGN LVSPFMALYL YGLAGGRFFQ AGIASQAPVA VSVIMGLFWA RLSDTNGSRK KFVQLSLATG SLASLALSFA SSINDAILVQ ILGAFTGSAG GAAFSALMAE VFKDKRGSRL GVYNASTVIG GFAGSMLSGF LYNWIGFRWM LRLNALLGVL PLVLISMIPE EGNRNPVSSR KLVSIPKIPR RFWKLYLARL VLSLPGALSG GVFSVYFVKY LNGPPEAWST LVAVTTLFGL ASIPYGRLAD RLSTREMFVL AGLGWTALYL GYFLSPNYLV FSLFFVIPVW PAFWLAYSKA LMDLSDESER ATFYAFEGTL SALFGSAVGV AAGLVADLYS PRTLFLLSSA SALLGAAVAG VLLR
|
| |