Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0004 |
Symbol | |
ID | 4601889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 3025 |
End bp | 4311 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639772757 |
Product | major facilitator transporter |
Protein accession | YP_919417 |
Protein GI | 119718922 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.44219 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAGG GAAGACCCAG GGTTAAAGTC ATAGTAGCCT CCCTAGCGCT CTTCGTGCTC TTCTACTTCA TGGGGTACTA CATGCTTAAC CCGATACTGC GAACGTTGCA CGCAGAGGGG CTCATTCCCG GGAAAAGCGA GGTCGAGTGG CGCTTCAACG CCGGGCTTAT AGCCACGCTA CTTCAAGGAA CGGGCCTCGT CCTCTCGTTC GTGTGGGGAG TGCTCGCCGA CAAGGTTGGT AGGCGCCCAG TGATTTTCAC GCTTGGAGTA ATAATGGGGT TAGGGCTACT CCTGGTGTCG CAGGCTAGGA GCTACGCCGA GCTACTCGCC TACTTCATCG CGTTCGGCGT GGGATACGTG GGCGTGGGAC CAGCGATATA CGCCTTTATC TCCGACGCCC TGCCCAGCGA GAGCAGGGGG AGAGGTTACG CGTCCTACTA CGTCTCAAGC GTGCTCGCCA TGATCCTAGG GCTGATAGTC GCCGGCGTAC TCCTGCCGTG GCGCACAGCC TACATGCTTG CAGGACTACT CACACTGGTG TTCTCCGTCC TCCTGTTCTA CTCTTCGAGG GGCATATTCA TAGGGTACTC GGAGAAAGGC GCTAGAGAGG CTAGGAGGTA CTCGCTCAGA GAATCCCTGC CGAGCCTGAG GAAGAAAAGC GTTCTGCTCG TACTCCTAAT GATCATACCC TGGACCATCC CTTGGGGTAT GCTGAGCATA TGGTCCATAG ACTACATTAG CACGAAGTGG GGCGTCTCGA CTGGTACAGC GTCGCTGATA ATAGCGGCGG CGACTGCTTC GATAGCTCTT GGACACATAG TGGGCGGCAC CCTGAGCGAT AGGCTCGCCG GCAAGGGGGA CTACACCGGG AGAACAAAGG TGTCGCTACT GGGAGTAGTA GTCGGGTACG TGTCGATGAT GCTCATGGTA ACATACCCCT ACCCCTACGG CAGTACGAAC TTTAAAGACC TCCTTGTACC CTCAGCGCTA GCAGTTGGCG GAATGATGTT CACCACGTTT GCGTACCCGA ATATAAACAC CGTTCTAAGC GAGGTGGTAG TCCCGGAGCA TAGAGGCACG GTGTTCGCTG TTTACAGCGT TCTAAACAAC CTGGGCTGGA CCCTGGGCCC AACGGTCTAC ACTCTTCTCC TCAAGGCATT CAGCGGCGTC TACGCAGACC AAGTATCGGC GATGACCGCG GCAGCGTCGA CGATAGTCTC CCTGTGGCTC ATACCTGCAC TTTGCTGGCT ACTGCTCCAC AGGGTCTACC CGAAGGAGAA GATATAG
|
Protein sequence | MPEGRPRVKV IVASLALFVL FYFMGYYMLN PILRTLHAEG LIPGKSEVEW RFNAGLIATL LQGTGLVLSF VWGVLADKVG RRPVIFTLGV IMGLGLLLVS QARSYAELLA YFIAFGVGYV GVGPAIYAFI SDALPSESRG RGYASYYVSS VLAMILGLIV AGVLLPWRTA YMLAGLLTLV FSVLLFYSSR GIFIGYSEKG AREARRYSLR ESLPSLRKKS VLLVLLMIIP WTIPWGMLSI WSIDYISTKW GVSTGTASLI IAAATASIAL GHIVGGTLSD RLAGKGDYTG RTKVSLLGVV VGYVSMMLMV TYPYPYGSTN FKDLLVPSAL AVGGMMFTTF AYPNINTVLS EVVVPEHRGT VFAVYSVLNN LGWTLGPTVY TLLLKAFSGV YADQVSAMTA AASTIVSLWL IPALCWLLLH RVYPKEKI
|
| |