Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0852 |
Symbol | |
ID | 4601977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 801452 |
End bp | 803299 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 639773630 |
Product | major facilitator transporter |
Protein accession | YP_920256 |
Protein GI | 119719761 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1392] Phosphate transport regulator (distant homolog of PhoU) |
TIGRFAM ID | [TIGR00153] conserved hypothetical protein TIGR00153 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGGTCCC TGAGACGCGA GAGCATCGCG ATGGCTCTAG TACTCTTAAC TGAGATACTC GTCGGGCTAG CTACAGGCGT GCAGCGAACC ATACTTGGTG TGGCTTCGCA CGCAGCTGGC GGGTCTTTCC TTCTGCCCAT AGTCTCGTTC GGCGCCTTCA AGGCTACGTT CGACCTGTTC ACGGGGCTGT ACGCGGGGAA GAGTAGGCGT AAGTCCCTGC TGACGGGAAC GCTGGTATAC ACTACGGGTG CGGTAGCCCT ATTGCTACTC CCTCCCCCGC TTAACTTCCT GGTAGGCAAC ATCTTCGTGG GAGCCGGCGA GGGGCTCGTC TTCGCTACCA GTGCGCTCGC AATCCGGGAC ATTCTGGGGC TCGAGCGTTC ATCGCTCAGC TTCGGATACA TCGAGAGCGC GTGCTACTTC GGTTACTCTA TCGGAGCGTT CGTTAGCGGG CTTGTCTACG GCTCCCTAGG GGCACCTGCG ACGCTGTTCG TGATTCTCGC TTCCTCTGTT CTAGGCGTGG TCTCGGCTGC CAGCTCGCAT GAAACTATGC AGTACACGCT GCAGGAGCGG GAGCGCTTCT CGACGACGAT GAAGACGTCG GAGATCGTGA AGCTACTCTT CTCGAACCCT AGCACGGCGT CAGCCCTTCT CGCGGCGCAT ATGGCGAAGG TAGCTGATAG CATTGCATGG GGTGTTATCC CGGTCTACAT GGTCGCCAAG GGTCTCCAGG TGTATCACGT AGGCTTCGCG CAGTCCCTGT TGCTACTCGT GTGGTCCTCT ACTATGCCCT TCTGGAGTTC GTTCTCGGAT AGGGTCGGCA GGCGCGCGCT GGCAACCCTT GGGTTGATGA TCAACGGCGC CCTCTTGATA GCCTTGCCGG GCACGCGTAA CTTCCCAGAG ATGCTGCTCA TAGTCCTGGT CATGGGTTTA AGCTACGCTA TGTACTACCC GATACTGCCT GCACCCGTGG CAGACATGAC GCCCCCGGAA GGGCGGGACC TAGCGGTAGG GGTTTACCGC GCGTTAAGGG ATTCGGGCTA CGCCACTGGA GCGCTTATCG CCACGCTTAT ACTCTCGGTT GCGCCCAGCT CCCTGGATAG CGTCTTCATA GATATCGGGA GTATGCTGGT AGTAACGGCA GCAGCCTTCT CCATCGTCTT CAGGGAAACG AGACCTACGT GGCCCTTCCT TAACCTCGTC ATAAGGCACG TTGAGATAAT AAGGGACGTG CTTGTGTACC AGCAGAAACT CGTGGAGAAA GCTTTCGGCG GCTACGCAGA GGAGTTGGAG TCGGGGATAC GCGTGTTAAA GGATATGGAG AGGAAGGCAG ACGCCGTGAA GAGGGAAGTC ACCTGGAGGA TTTACTCGGG GTTGCTACCC ACATCCAGCA GAATAGACTT CGAGAGGCTC GTCGAGGAAA TCGACAAGGT CGCCGGCGCG GTTATAGAGT GCAACGAGAG GCTTCTATGG GTGAAGCACA GCGAAAAACT CCGGGACTTG AAGCAACTCC TGCTGGAAAT GTTGAACGAG AACATCAGGC TGGCGGACAT GCTCATAGAA AACCTGCGCG TGCTCAGCCT ATCCCCACTC TACGCGGTGC GCGCTTCGAT CGAGATAGAC GCGGGAGAGA GGAGGGTCGA CGAGTTAAGG ATAAAGGCAA TACACATGAT TAGAAAGCTC TTGGACGAAA ACGAGATCGA CATCATGTCG GCGCTGAGCC TCATGGAAGC TGTAAACCTG CTAGAGCTAA CGAGCGACGA CTTCCAGGAC GCCGCCGACA TCATCAGGAT AATCAGCTAC CGGCACGCCG CCCTACCTCC CGATAGAATC GCGCGGTTCG GCGCCTAG
|
Protein sequence | MGSLRRESIA MALVLLTEIL VGLATGVQRT ILGVASHAAG GSFLLPIVSF GAFKATFDLF TGLYAGKSRR KSLLTGTLVY TTGAVALLLL PPPLNFLVGN IFVGAGEGLV FATSALAIRD ILGLERSSLS FGYIESACYF GYSIGAFVSG LVYGSLGAPA TLFVILASSV LGVVSAASSH ETMQYTLQER ERFSTTMKTS EIVKLLFSNP STASALLAAH MAKVADSIAW GVIPVYMVAK GLQVYHVGFA QSLLLLVWSS TMPFWSSFSD RVGRRALATL GLMINGALLI ALPGTRNFPE MLLIVLVMGL SYAMYYPILP APVADMTPPE GRDLAVGVYR ALRDSGYATG ALIATLILSV APSSLDSVFI DIGSMLVVTA AAFSIVFRET RPTWPFLNLV IRHVEIIRDV LVYQQKLVEK AFGGYAEELE SGIRVLKDME RKADAVKREV TWRIYSGLLP TSSRIDFERL VEEIDKVAGA VIECNERLLW VKHSEKLRDL KQLLLEMLNE NIRLADMLIE NLRVLSLSPL YAVRASIEID AGERRVDELR IKAIHMIRKL LDENEIDIMS ALSLMEAVNL LELTSDDFQD AADIIRIISY RHAALPPDRI ARFGA
|
| |