Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1831 |
Symbol | |
ID | 4602068 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1773655 |
End bp | 1774947 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639774604 |
Product | major facilitator transporter |
Protein accession | YP_921229 |
Protein GI | 119720734 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.612991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTAACGA GAAAGCAGAG AGTAGCGTAC GGCTTTGCAC GTTTTGGATC CACAATATTC ATGGGGCTCT ACGACTTTGC GAGCTTCTAC ATCTACTGGC AGGTCTTCCG GCTCGACCCC TTGCTGTCCG GCTACATGGG CGCGATAGGT AAGATAACCA TTATGGTTGC AAGCCTGCTG GTAGGGTATC TCAGCGACTC CATATGGACC CGGTGGGGGC GGCGTAAGCC CTTCATAGCT ACTGGGGCGC CCCTCCTGGC TTTCTCAGGG TTCCTTCACT TCACCCCTAT ATATTTCCTT CAGGGAGCTA CACAGGAAGC CTTGTTCCTG TGGGGCGCCG CTACAAGCTC CATGTTCCAC TTCTTCTACG CGTGGCTTTT GACACCCTTC CAGGCGTGGC TACCGGAGAT ATCCGAGCCC TCCGAGAGGA TAGACATTTC GATGCTACAG AACGTGGCTA ATATACTCGG GAACATCGTC AGCACGGTTA CGGGGTTCCT CGCAAAGATG CTGGTAGCAC GTGGCCTTCT AATGCCGCTA GTAGGTGCGT ACGCTTTGGT TCTCGTAGCG CTCTTTACGC CACCGGTAGT CCTGCTACCT GTTGAGAGGA GAGCGGAGCC CACGAGACCG TCCCTCAAAG ACCTTCTAAC CGTGTTCAGG TACAGGGAGT ACATGAAGTG GATGGCTGTT CGAGGACTGA TGTCCTCCAG CGTTCAAATG CTTACAATAA CGATAATCGC GTACATAGAG AAGGTGATCG GGGTAGAACA CTCGCTTGCC TCGGCATCGT TCGGCGTTAT ACTCGTCGTC TTCGTAGCCG GCGCGTTCCC GCTATGGGGT AGGCTGGGCA AGAGGGCGGG AAAGGGGAGG GCGCTCACGC TTTCAATGAC GTTGCTGGGA GCTACGCTAC TCATGACCCC GCTCCCCCAC TTCGTCGACC CCGGTGTTTT CAGGGTGCTC CTAGGCTACG TGCTGGTAGC CCTCGGCGCT GTCGCCGTCT CAGCGTACGT CCTATTCCCC TACGCAGTGC TAGCTGACCT CGCGCACTGG TACGAGGTGC AGACCGGGGA AAAGCGGGCG GGTCTCTTCA CGGGGTTCGA GGGGATACCT ATCAACGTGT TCGAGTCGCT GGCCTACCTA GTCACGGGGT TCCTAATGAG CTTGCCGCAG GTGCCTGGCT CCGAGTACAC CTACGGGCTG ATCCTGTGGG GACCCGTAGC TTCGCTGTTC ACACTGGTAG CTCTCGCCAT CCTAAGGAGA ACGAACGTAG ACCCCTTCCT CGCCAAGAAG TAG
|
Protein sequence | MLTRKQRVAY GFARFGSTIF MGLYDFASFY IYWQVFRLDP LLSGYMGAIG KITIMVASLL VGYLSDSIWT RWGRRKPFIA TGAPLLAFSG FLHFTPIYFL QGATQEALFL WGAATSSMFH FFYAWLLTPF QAWLPEISEP SERIDISMLQ NVANILGNIV STVTGFLAKM LVARGLLMPL VGAYALVLVA LFTPPVVLLP VERRAEPTRP SLKDLLTVFR YREYMKWMAV RGLMSSSVQM LTITIIAYIE KVIGVEHSLA SASFGVILVV FVAGAFPLWG RLGKRAGKGR ALTLSMTLLG ATLLMTPLPH FVDPGVFRVL LGYVLVALGA VAVSAYVLFP YAVLADLAHW YEVQTGEKRA GLFTGFEGIP INVFESLAYL VTGFLMSLPQ VPGSEYTYGL ILWGPVASLF TLVALAILRR TNVDPFLAKK
|
| |