Gene Tpen_1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1533 
Symbol 
ID4600375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1480794 
End bp1482152 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content60% 
IMG OID639774307 
Productcitrate transporter 
Protein accessionYP_920932 
Protein GI119720437 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.232848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCGCCT TGCGGGTGGC ATTCGTCTAC TCCGTGCACA TGAGAGAAAC GTTTTTCGCA 
ACGGGTTCTG AGTGTTGGAA GCGTATGTTC GACGCGAAGC CCCTGATAGG TGCCGGTGTA
CTGGTGTACC TAACGGTAGC CCTCGTGGCG CGTAGCAGGA GGCCTAAGAC TCCTGTGTGG
AGCATCATGG CGTTTGCCTC TTTCATAGTC GTAGCCACGG GGCTTCTCGG CATAGACGAC
GTGAGGAAGA GCGTAGACGT AGACGTGATA CTATTCCTCG TGGGTATGTT CAGCATAGTA
GGGCTCGCGG AGACGAGCGG GCTCCTCACT GCCGCCTCCT ACTTCTTCGT ATCCAGGTTT
CACAGCAGGG TGAAGCTCTT CTACGCGTCG GCCGTTCTCT TCGGCCTGCT CGCCGCGTTC
GCGGTAAACG ACACCGTTGC CCTCATGGGC CCCGCGGTGG CGTACGTGAT TTCGCGGGCG
GCCGGCATAG ACCCCAAGGC GATGTTCCTC CTCCTGGCCT TTTCGATAAC GATAGGGTCG
GCGATGACCC CCATAGGGAA CCCCCAGAAC GTGCTCATAG CCTCGGGCTC CGGGATGCCG
GCCCCAATGC TGGTATTCAC GGCTAGGCTG GCGGTACCCA CGCTCGTCAA CCTGCTCCTA
ACGGCCTACC TGCTCTCCAA GCTCTATGGG CTGAGGGACG CTAAGGTGCA GGTGGCGCTG
ATCCCGGAGG AAGCCATAAG GAACAGGAGG GATGCCGCCC TGGCGGCCGC CGGCCTCGCC
GGAACAGTTC TCGCGCTGGT GGTCAACGAC TTCCTCGAGC TCGCCGGGAT GCCGCACGTA
TCGGACAGGG GCATTATACC GTTCGTCGCT GCCGCCGCTA TCTACCCGTT CACCTCCAAC
CCGAGGAGGA TCCTCTCGAG GGTCGACTGG TCCACCGTAG TGTTCTTCAT AACCATGTTC
ATAACGGTCG CGGGGGTTAT GAGGAGCGGG GTCGTCGACC CCGCACTACG GCTCTTGCTC
CCCGAGAAGG CTACCGGAGC CCGGGATCTC TTCGCGATAG CCCTCCTCTC GCTGGCGCTG
AGCCAGTTCC TGAGTAACGT GCCGCTGGCA AGCATAATGG TGGAGTACAT GAGGGGGCTA
GGCTACTCGA GTACCGATGT CCGAGCCTGG CTAACGCTGG CAACAGCTTC AACCATCGCC
GGCAACCTTA CCCTGCTGGG CGCGGCTTCG AATATCATCA TTCTCGAGAT GCTCGAAAGG
CGCTTCAAGA CGACGATAAC ATTCACGGAG TTCCTCAGGG TAGGCGTGCT CGTAACTGCG
CTGAACATGC TCGTATACGC GCCGTTCCTA CTCTTGTAG
 
Protein sequence
MLALRVAFVY SVHMRETFFA TGSECWKRMF DAKPLIGAGV LVYLTVALVA RSRRPKTPVW 
SIMAFASFIV VATGLLGIDD VRKSVDVDVI LFLVGMFSIV GLAETSGLLT AASYFFVSRF
HSRVKLFYAS AVLFGLLAAF AVNDTVALMG PAVAYVISRA AGIDPKAMFL LLAFSITIGS
AMTPIGNPQN VLIASGSGMP APMLVFTARL AVPTLVNLLL TAYLLSKLYG LRDAKVQVAL
IPEEAIRNRR DAALAAAGLA GTVLALVVND FLELAGMPHV SDRGIIPFVA AAAIYPFTSN
PRRILSRVDW STVVFFITMF ITVAGVMRSG VVDPALRLLL PEKATGARDL FAIALLSLAL
SQFLSNVPLA SIMVEYMRGL GYSSTDVRAW LTLATASTIA GNLTLLGAAS NIIILEMLER
RFKTTITFTE FLRVGVLVTA LNMLVYAPFL LL