Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1303 |
Symbol | |
ID | 4601597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1245159 |
End bp | 1246346 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639774079 |
Product | major facilitator transporter |
Protein accession | YP_920704 |
Protein GI | 119720209 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0834796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAAG GGTCTCTTTC CTTCTACATC TCGCTGTACA TAGTAGTCTT TATAACAATG ATAGACCACT CGACGATATC TCCTATAGTG GCGCAGTACG CGAAGTCTCT GGGAGCCTCG GACTCCCTAG CAGGTATCAT CGTGGCGTCG TACTCTATGT CTGCCCTAAT CTCTCTACCC GTAGTGGGCC TACTGCTAGA CAAGGTCTCG AGGGCAGGCG TGGTCCAGTC TCTCATACTC GCAGACCTGG CTGTAGTATA CCTCTACACC CTCCCAAGAA GCCCCACGGA GCTACTCGTG GTGAGGGCTC TGCACGGGGC GGTAGACTCA GCCCTCTTTC CAGGTATACT GGCTGTATTT AGAGACATGG TCACTAGGAG ACTGGAGACA GGGTTCACGC TATACTGGGT CGTGACGGGG ACGCCGATAG CTATCGGGAG CCTGCTCGCG AGGACAGTAG TCCTGCACTA CGGCTTCCGC GGGGTCTTCT ACGCATTGAT ACCACTCTAC GTTGCAGGCT TGATCGCCTC TATAGAGATC TCTAGGAGGT ACAGGGAGGC CTTGGCCAGG AGAGCACTGT GGGAGGGGCC GAGGGATCCC GTGGCCCTCG GAACGCTGGT AGCCGCGTAC GTCTCTGCAT ACATCCTCTA CACGGGCATA GGGACCGTGG TGGGTAGTAT GTCAATCTCT CTGACTAGAG GCCTGGGGCT CCCCAGGGAG GCCGCGGCCG CGGAGGTTGC TTGGTGGGCC TTTCAGGCCA CCCTCTTCTC TCTCTTAGTC ATGGCCTTCA CCGCAAGACA CGTTGTCAGA GAGGTCTCAA AAGCGCTGTG GGCCCTGGCG ATAGGCCTGT CTGGCGTAGC TCTCTCCATG GGACTGCTTC TAGTGAGCAT AGAGAGCCTC TCGAGGACCC TCTCGTCCCT ATTCTTCGGG GTAGCTCTCG GCACGGTACT CCCGGCCTCC TCGAAGATCG TGTCCGACAC CCACTATAAG TACAGGGGGA GGGCCTCTGC ACTACTATCG GTATCCTTCC TCACAGGCGT CATAACAGGG GCTGTGGCCT CCTCGAGGCT CCTCGAGACG GCACACGGCC TCTACACGAA CTACCTGCCC GCCGCGGCAG GCGCCTCTCT AGGACTCCTG CTCGTGCTCT ACGTAGCCTC TAGGAGACGG GCCTCCCGCA GTACCTAG
|
Protein sequence | MGKGSLSFYI SLYIVVFITM IDHSTISPIV AQYAKSLGAS DSLAGIIVAS YSMSALISLP VVGLLLDKVS RAGVVQSLIL ADLAVVYLYT LPRSPTELLV VRALHGAVDS ALFPGILAVF RDMVTRRLET GFTLYWVVTG TPIAIGSLLA RTVVLHYGFR GVFYALIPLY VAGLIASIEI SRRYREALAR RALWEGPRDP VALGTLVAAY VSAYILYTGI GTVVGSMSIS LTRGLGLPRE AAAAEVAWWA FQATLFSLLV MAFTARHVVR EVSKALWALA IGLSGVALSM GLLLVSIESL SRTLSSLFFG VALGTVLPAS SKIVSDTHYK YRGRASALLS VSFLTGVITG AVASSRLLET AHGLYTNYLP AAAGASLGLL LVLYVASRRR ASRST
|
| |