Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1599 |
Symbol | |
ID | 4601530 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1548268 |
End bp | 1549533 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 639774372 |
Product | major facilitator transporter |
Protein accession | YP_920997 |
Protein GI | 119720502 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGAGGT TCAGCTACGG GAAGATATTC CTCCTGGGGT TCGGGTTCTT CGGGATAAGC ATCCTGTGGT CTATCTACAA CTCCTACGTC CCGATATTCC TGAAAGAGTT CGGGCTCGCA TCGTGGCTCG TAGGGTTCAT AATGACTATC GACAACATAT TCGCGGTCGT GCTTCTCCCC TACATAGGCG TGCTGAGCGA CGTAACTAGG ACGAGGATAG GCAGAAGGAA GCCCTACATA ATCCTCGGGG CCCCTCCAGC CGCGCTGACA TTCGCGCTGA TACCGCTACT CAGAGGAGAC TTCTACGCGA TGCTAGCCGT TATAGTCGTG ATGAACTTTT CGATGGCGTT GTTCAGGTCG CCTGTAATAG CGTTCATGCC CGACATAACC CCCTCGGAGA AGAGAAGCCA GGCGAACGGC ATAATAAACT TCATGGGCGG CGTTGGATCC CTCCTAGCGT TCTTCGTGGG CGCAAAGCTC TACGAAATGA ACCCCTCCTA CCCCTTCGTA GCCGCGGCAG TCACGATGCT CCTGGCATCG CTACTCGTTG TCCTACTCGT AGACGAGCCC GAGGAGTTCA AGGCGAGGGG AGGGTCCGTC AGGCTGGGCG AGCTCCTGCG GGAGTCGTTT AGGAAGAGCT TCTCAGAGCT CTCGGCGAAC CTCAGGGAGG CGTTCCTGGG CGAGGATAAA AGCCTCCTCT TCATGCTAGC CTCGATCTTC CTGTGGTTCA TAGGCTACAA CGCGATAGAG ACTTTCTTCA CCAGCTACGC GAAGTGGTAC CTGGGGATCG GGGAGGCGGC GGGCTCCCTG ATCCTGGGCT TCGTGGCGCT CGGGTTCCTC GTGTTCTCCC TACCCGCGGG CTTTATCGGG GCGAGGCTCG GCAGGAGGAA GACCATGACG CTCGGGCTCG CCCTGCTGGT AGTCCTGCTC GGCTTAGCGT TCTACGCCTC GACAGCCGTG AAGACAGGGG CGGTGATCTA CGTTCTAGGC GCCATATTCT TCTTCGGAGG ATTCGCTTGG GCCCTCGTCA ACGTCAACTC CCTTCCAACC GTGGTGGACA TGACCAGTAG GGAGAGGCTC GGAGCATACA CGGGGCTCTA CTACTTCGCG TCCCAGAGCG CCGCTATAAC CGCCCCGCCG CTGGCAGGCC TCTTCATAGA CGTGCTCGGC TACCAGGCGC TATTCCCCTA CTCGATAGTC TTCCTCCTAG CGTCCGCGGT AACACTACAG TTCGTTAAGC GCGGGGAAGC CAGGAAAGGG TTCTAA
|
Protein sequence | MERFSYGKIF LLGFGFFGIS ILWSIYNSYV PIFLKEFGLA SWLVGFIMTI DNIFAVVLLP YIGVLSDVTR TRIGRRKPYI ILGAPPAALT FALIPLLRGD FYAMLAVIVV MNFSMALFRS PVIAFMPDIT PSEKRSQANG IINFMGGVGS LLAFFVGAKL YEMNPSYPFV AAAVTMLLAS LLVVLLVDEP EEFKARGGSV RLGELLRESF RKSFSELSAN LREAFLGEDK SLLFMLASIF LWFIGYNAIE TFFTSYAKWY LGIGEAAGSL ILGFVALGFL VFSLPAGFIG ARLGRRKTMT LGLALLVVLL GLAFYASTAV KTGAVIYVLG AIFFFGGFAW ALVNVNSLPT VVDMTSRERL GAYTGLYYFA SQSAAITAPP LAGLFIDVLG YQALFPYSIV FLLASAVTLQ FVKRGEARKG F
|
| |