Gene Tpen_0852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0852 
Symbol 
ID4601977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp801452 
End bp803299 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content57% 
IMG OID639773630 
Productmajor facilitator transporter 
Protein accessionYP_920256 
Protein GI119719761 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1392] Phosphate transport regulator (distant homolog of PhoU) 
TIGRFAM ID[TIGR00153] conserved hypothetical protein TIGR00153 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGTCCC TGAGACGCGA GAGCATCGCG ATGGCTCTAG TACTCTTAAC TGAGATACTC 
GTCGGGCTAG CTACAGGCGT GCAGCGAACC ATACTTGGTG TGGCTTCGCA CGCAGCTGGC
GGGTCTTTCC TTCTGCCCAT AGTCTCGTTC GGCGCCTTCA AGGCTACGTT CGACCTGTTC
ACGGGGCTGT ACGCGGGGAA GAGTAGGCGT AAGTCCCTGC TGACGGGAAC GCTGGTATAC
ACTACGGGTG CGGTAGCCCT ATTGCTACTC CCTCCCCCGC TTAACTTCCT GGTAGGCAAC
ATCTTCGTGG GAGCCGGCGA GGGGCTCGTC TTCGCTACCA GTGCGCTCGC AATCCGGGAC
ATTCTGGGGC TCGAGCGTTC ATCGCTCAGC TTCGGATACA TCGAGAGCGC GTGCTACTTC
GGTTACTCTA TCGGAGCGTT CGTTAGCGGG CTTGTCTACG GCTCCCTAGG GGCACCTGCG
ACGCTGTTCG TGATTCTCGC TTCCTCTGTT CTAGGCGTGG TCTCGGCTGC CAGCTCGCAT
GAAACTATGC AGTACACGCT GCAGGAGCGG GAGCGCTTCT CGACGACGAT GAAGACGTCG
GAGATCGTGA AGCTACTCTT CTCGAACCCT AGCACGGCGT CAGCCCTTCT CGCGGCGCAT
ATGGCGAAGG TAGCTGATAG CATTGCATGG GGTGTTATCC CGGTCTACAT GGTCGCCAAG
GGTCTCCAGG TGTATCACGT AGGCTTCGCG CAGTCCCTGT TGCTACTCGT GTGGTCCTCT
ACTATGCCCT TCTGGAGTTC GTTCTCGGAT AGGGTCGGCA GGCGCGCGCT GGCAACCCTT
GGGTTGATGA TCAACGGCGC CCTCTTGATA GCCTTGCCGG GCACGCGTAA CTTCCCAGAG
ATGCTGCTCA TAGTCCTGGT CATGGGTTTA AGCTACGCTA TGTACTACCC GATACTGCCT
GCACCCGTGG CAGACATGAC GCCCCCGGAA GGGCGGGACC TAGCGGTAGG GGTTTACCGC
GCGTTAAGGG ATTCGGGCTA CGCCACTGGA GCGCTTATCG CCACGCTTAT ACTCTCGGTT
GCGCCCAGCT CCCTGGATAG CGTCTTCATA GATATCGGGA GTATGCTGGT AGTAACGGCA
GCAGCCTTCT CCATCGTCTT CAGGGAAACG AGACCTACGT GGCCCTTCCT TAACCTCGTC
ATAAGGCACG TTGAGATAAT AAGGGACGTG CTTGTGTACC AGCAGAAACT CGTGGAGAAA
GCTTTCGGCG GCTACGCAGA GGAGTTGGAG TCGGGGATAC GCGTGTTAAA GGATATGGAG
AGGAAGGCAG ACGCCGTGAA GAGGGAAGTC ACCTGGAGGA TTTACTCGGG GTTGCTACCC
ACATCCAGCA GAATAGACTT CGAGAGGCTC GTCGAGGAAA TCGACAAGGT CGCCGGCGCG
GTTATAGAGT GCAACGAGAG GCTTCTATGG GTGAAGCACA GCGAAAAACT CCGGGACTTG
AAGCAACTCC TGCTGGAAAT GTTGAACGAG AACATCAGGC TGGCGGACAT GCTCATAGAA
AACCTGCGCG TGCTCAGCCT ATCCCCACTC TACGCGGTGC GCGCTTCGAT CGAGATAGAC
GCGGGAGAGA GGAGGGTCGA CGAGTTAAGG ATAAAGGCAA TACACATGAT TAGAAAGCTC
TTGGACGAAA ACGAGATCGA CATCATGTCG GCGCTGAGCC TCATGGAAGC TGTAAACCTG
CTAGAGCTAA CGAGCGACGA CTTCCAGGAC GCCGCCGACA TCATCAGGAT AATCAGCTAC
CGGCACGCCG CCCTACCTCC CGATAGAATC GCGCGGTTCG GCGCCTAG
 
Protein sequence
MGSLRRESIA MALVLLTEIL VGLATGVQRT ILGVASHAAG GSFLLPIVSF GAFKATFDLF 
TGLYAGKSRR KSLLTGTLVY TTGAVALLLL PPPLNFLVGN IFVGAGEGLV FATSALAIRD
ILGLERSSLS FGYIESACYF GYSIGAFVSG LVYGSLGAPA TLFVILASSV LGVVSAASSH
ETMQYTLQER ERFSTTMKTS EIVKLLFSNP STASALLAAH MAKVADSIAW GVIPVYMVAK
GLQVYHVGFA QSLLLLVWSS TMPFWSSFSD RVGRRALATL GLMINGALLI ALPGTRNFPE
MLLIVLVMGL SYAMYYPILP APVADMTPPE GRDLAVGVYR ALRDSGYATG ALIATLILSV
APSSLDSVFI DIGSMLVVTA AAFSIVFRET RPTWPFLNLV IRHVEIIRDV LVYQQKLVEK
AFGGYAEELE SGIRVLKDME RKADAVKREV TWRIYSGLLP TSSRIDFERL VEEIDKVAGA
VIECNERLLW VKHSEKLRDL KQLLLEMLNE NIRLADMLIE NLRVLSLSPL YAVRASIEID
AGERRVDELR IKAIHMIRKL LDENEIDIMS ALSLMEAVNL LELTSDDFQD AADIIRIISY
RHAALPPDRI ARFGA