Gene Tpen_0341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0341 
Symbol 
ID4601451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp311567 
End bp312985 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content54% 
IMG OID639773101 
ProductV-type ATP synthase subunit B 
Protein accessionYP_919753 
Protein GI119719258 
COG category[C] Energy production and conversion 
COG ID[COG1156] Archaeal/vacuolar-type H+-ATPase subunit B 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGACG CCCAGATACT TGAGGAAGTA GTTCGAATGT ACCCCGGCAA GGTCTACAGA 
GGTGTGAAGG AGATTCGAGG AAGCCTCCTG ATAGTTGACG GCATCGAGGA AGCTGCCTAT
GACGAAGTCG TGAAGATATA CGGCAAGGAC TCCCGCGAGA GGTTTGGCCG TGTACTCGAG
ACGAGCATAG GGCAGGCAGT GGTACAGGTT CTGGGCGATA GGGAAGGGCT CGAAACGGAT
ACTCTCTTAA AGTTCACGGG GTCCACCTTT AAGATCAGAG TCTCGGAGGA CGTTATCGGA
AGAGTGTTCA ACGGTAGATT TGAACCGATA GATGGGTTAC CGCCCATACT TTCCGGAGAG
CTGAGGGAGA TAACCGGCGA ACCCATAAAC CCAATCTCGC GCGAGTATCC TCATGACTTT
ATACAGACAG GAGTAAGCGC CATTGACGGC TTATTCAGCA TGGTTAGAGG CCAGAAGCTA
CCAATTTTCA GCGTGTCCGG ACTCCCGCAC AACCTTCTGG CGGCACAGGT CGCCAGACAG
GCCACAGTGA GAGGTGAAGG CGAACAATTC GCGGTCGTTT TCGCGGGCAT AGGGTTGAGG
AAGACCGAGG CCGAGTTCTT TCTAGAGCAG TTCAGGGAGA CAGGTGCTAT TGAAAGGCTG
GTAGCGGTTC TCAACATGGC AGACGACCCG GCCGTAGAAA GGCTGATGAC TCCCAGAATA
GCGCTTACAG TAGCGGAATA CCTCGCTTTT GACCTCGACA TGCACGTCCT GGTAATAATG
TCCGACATGA CGAACTACTG CGAGGCTCTC CGAGAGGTTA GCTCGGCGAG GGGAGAAATA
CCGGGAAGGC TCGGGTACCC GGGCTACATG TACAGTGACC TAGCTACGAT TTACGAGAGG
GCCGGCGTCA TAAAGGGCAA AAAGGGTAGT ATAACCCTCT TTCCAATATT AACGATGCCG
GGCGGTGACC TCAGGCATCC CATACCTGAC CTCACCGGGT ACATAACCGA GGGGCAAATC
TTCCTGTCTC AGGAGATGTA TGCCCAGGGA ATCTACCCGC CTATCAATAT TCTGCCGAGC
CTAAGCCGTC TCATGAAGTC GGGCATAGGT CCCGGGAAGA CGCGAGAAGA TCACAGGTAT
CTCGCAGACC AGCTCTACGA TGCATACTCC AGGGGAGTCA AAGCGCGCGA CCTGGCGAGA
ATCATCGGCG AGATAGGGCT CAGCGAGCGG AATAGGAGGT TCTTGAAGTT CGCGGAGGAA
TTCGAGAACA AGTTCGTAAA CCAAGGATTC TACGAGAATA GGAGCATCGA GGAGACCCTC
GACCTCGGGT GGCAAGTGCT CTCCATACTC CCAGAGGAGG AGCTCGTGAG GATACCTCAG
AAAATCATCG AAAAGTACCA CCCGAAGTAC AGGTCGTGA
 
Protein sequence
MMDAQILEEV VRMYPGKVYR GVKEIRGSLL IVDGIEEAAY DEVVKIYGKD SRERFGRVLE 
TSIGQAVVQV LGDREGLETD TLLKFTGSTF KIRVSEDVIG RVFNGRFEPI DGLPPILSGE
LREITGEPIN PISREYPHDF IQTGVSAIDG LFSMVRGQKL PIFSVSGLPH NLLAAQVARQ
ATVRGEGEQF AVVFAGIGLR KTEAEFFLEQ FRETGAIERL VAVLNMADDP AVERLMTPRI
ALTVAEYLAF DLDMHVLVIM SDMTNYCEAL REVSSARGEI PGRLGYPGYM YSDLATIYER
AGVIKGKKGS ITLFPILTMP GGDLRHPIPD LTGYITEGQI FLSQEMYAQG IYPPINILPS
LSRLMKSGIG PGKTREDHRY LADQLYDAYS RGVKARDLAR IIGEIGLSER NRRFLKFAEE
FENKFVNQGF YENRSIEETL DLGWQVLSIL PEEELVRIPQ KIIEKYHPKY RS