Gene Tpen_1541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1541 
Symbol 
ID4600834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1488979 
End bp1490625 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content53% 
IMG OID639774315 
ProductABC transporter related 
Protein accessionYP_920940 
Protein GI119720445 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR01166] cobalt transport protein ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.795149 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGTTCAG CCGTGGAGTT CCCGGTGGTC TTACGGGTGC CTGAAATAGA GGTCGAGGAG 
CTTTCGTTTA CTTACGCCGG TAAAGGGGAG CCAGCGTTAA GGAACGTTAG CTTGGAAGTA
GACAAGGGCG AAGTGGTGCT TCTTGCCGGT AGGAGCGGGA GCGGGAAATC TACGCTTTTA
AAGGCCATCA ATGGGCTGAT ACCCCATAGG TACTCGGGAA CTTATAAGGG AGCCGTCAGG
GTAAGGGGGC TTACCGTAGC AGAAACTCCC GTCTACGAAT TGTCGAGAAT TGTAGGCACA
GTTATGCAGG AGGTCTCGAA GCAACTATTT CTGCCAACAG TAGCGGATGA CGTAGCGTTT
GGCCCCTCCA ACCTATGCAT CGAGAGAGAG GAAATTGAGA GGAGGGTTGA AGAGGCGCTG
AGTAGACTCG GCATACTGCA CCTGAAGGAT CGCGACGTGA ACGAGCTGAG TGGCGGCGAG
AAACAGAGAG TAGCCATAGC CGGCGTCTTA GCCATGAAGC CCGAAATAAT ACTCATGGAT
GAGCCTCTAG CCAATCTCGA TAGCGAAGGT GTAGCCACGG TTCAAGAGGT GATAAAGAGT
TTCCGCGAAG AGGGAAAAAC AGTCGTAATA GCGGAACACA GAGTTGAGGA AGTTCTGGAG
GTAGGGGTGG ACCGGGTCTT CGTGATGAGA GAGGGAACCA TAGTTAGGGA GGTGGAGAGA
GTAGAGGAGC TCGCCAACTT TGCGGATGAA CTGAAGGTTC CAGCAGAGGC TTACTTGCCC
CCGGGCATCG TGGAGGCTCC ACGCATCTCT CTGCCGCATC ACTCTCCGGG GGTAGTCGCC
GTAGAGTTTA GGAATGTGAG CTTTAGCTAC GAGGACAAGC CCGTACTCAA AGGGATAAAC
CTTGAGATAC GCGAGGGGGA GAGAGTCGCG CTTCTCGGAA ACAATGGCGC CGGTAAAAGC
ACAATGGCGA AGCTCATGCT CGGCTTGCTT AAACCTACGG AGGGTAGCGT CCTGGTCTAC
GGACGGGATA CCCACACGAT GGAAGTCTAT GAAGTCGCGC CGATGGTCGG GCTAGTGTTT
CAGGATCCTT TCAGCATGCT ATTCGCGGAG ACCGTGCGAG AAGAAATAGC ATTTGGGCCT
AGAAATCTGG GAGTTCCGCC CTCCGAGATA CCTGGGAGAG TGGAGGAGGC GGCGGAGAGG
TGCTTTGTGA AGCACTTGCT CGACTTCTCC CCCTTTGCGT CTAGCCATGG GGAGAAGAAG
AGAATATGCG TTGCCTCAAT ACTATCTATG AAGCCAAGGA TTCTGGTGCT GGACGAGCCT
ACAGCGGGGC AGGACTACGC TACATATACT GCTTTCATGG AGTTCGTAGA CTCCCTGGTA
AACGCCGGCG TCATTAAAAC ACTTGTACTA ATAACGCATG ACACCGATCT CGCAGTGGAA
TACACAGACA GAACAATAGT GCTTGCTAAC GGAGAGATAG TGGCAGACGG ACCAACTAGG
AAAGTGCTTT CAGACTCCTC TGTGCTTGCA AAGGGGAAGA TTCGTGAGAC AAGCCTTATT
AGGCTAGGCA AAAGGCTCAC CGGTGGAAAC TACATCTTGT CGAGAAAAGA ACTAGCTATG
CTAGCACGTA GCAGGACGCA TTCGTAG
 
Protein sequence
MRSAVEFPVV LRVPEIEVEE LSFTYAGKGE PALRNVSLEV DKGEVVLLAG RSGSGKSTLL 
KAINGLIPHR YSGTYKGAVR VRGLTVAETP VYELSRIVGT VMQEVSKQLF LPTVADDVAF
GPSNLCIERE EIERRVEEAL SRLGILHLKD RDVNELSGGE KQRVAIAGVL AMKPEIILMD
EPLANLDSEG VATVQEVIKS FREEGKTVVI AEHRVEEVLE VGVDRVFVMR EGTIVREVER
VEELANFADE LKVPAEAYLP PGIVEAPRIS LPHHSPGVVA VEFRNVSFSY EDKPVLKGIN
LEIREGERVA LLGNNGAGKS TMAKLMLGLL KPTEGSVLVY GRDTHTMEVY EVAPMVGLVF
QDPFSMLFAE TVREEIAFGP RNLGVPPSEI PGRVEEAAER CFVKHLLDFS PFASSHGEKK
RICVASILSM KPRILVLDEP TAGQDYATYT AFMEFVDSLV NAGVIKTLVL ITHDTDLAVE
YTDRTIVLAN GEIVADGPTR KVLSDSSVLA KGKIRETSLI RLGKRLTGGN YILSRKELAM
LARSRTHS