Gene Tpen_1506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1506 
Symbol 
ID4601207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1454265 
End bp1455890 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content61% 
IMG OID639774281 
Productcytochrome bd ubiquinol oxidase, subunit I 
Protein accessionYP_920906 
Protein GI119720411 
COG category[C] Energy production and conversion 
COG ID[COG1271] Cytochrome bd-type quinol oxidase, subunit 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0741111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACCTTT TAGAGCAGTG GTTGAGCTAC CCTGCTGCGA TCGACAGAAT TCTATCGATA 
ATAGGCATAG AGATCCACTG GTTCATCCTC CAGTACGTAC TCGGGCTACC GCTGGTAATC
CTCGTCGCCC TCTTAGCCTA CAAGAGGACT GCTAACGAGA GCTGGCTCAA GCTCGCCAGG
ACATCGGCGA AGGCCCTCGG ACTAGTGTTC GCCGTGGGAG CAGCTAGCGG TACGGCTTCG
GAGTTCGGGC TCCTCGTGAT ATGGCCGAAC CTTTTGGAGG CCGCCGGGAG GTACATCTAC
TTCCCGCTCT ACGTAGAGAT ATTCGCCTTC CTAACGGAGA TAACCTTCAT CTACCTGCTG
GTCTTCGGCT GGGGCAAGCT ATCCCTTAAC GGCAGGATAG CGGTAGCCTT CCTGGCGCTT
CTCGGCGCCT GGCTGAGCGG AGCGATGATT ATGAGCGTGA ATAGCTACAT GGTGGCTCCG
ACGGGGGTCT CGCCAGCCTT CAGCCAGGAA GCCGGCTGGC TCTACTCGCA GGGGTACCCC
AAAATTCTCC TAGTAGTCCC GGAGCGGCTC GTAGACGCTC TGGACGTCGG GAAGCTCCAG
TCGCTGGGTA TGGAGGTCGT CGGGCGGGTA GGCGGGGGAG TCGCGGTGTA CATGCCCTCG
AGGATCGTAG CGCGCCTCGC CTACGAGGCG TGGGGAGGTA GAACCGTGGG GGAGAGCGTG
CTGGCACTAG TGGTGAAGCC CGAGGCTCTT CCAAGCCTTA AAGCCACGCT GGTAAAAGAC
GTCGTAGACG CTGTGCTCAC GGAGACTGTT AGAACCGTCG GCTACACTAC AGTGACTTTC
AAGTCCCCTG TCTTTGTTGG AAGCTTTCTG CACGCCGTCG GGGCGGCGCT CACAGTAACG
GGCTTCACCA TCGCGGGAGC CTACGCCCTC CTGTCCCTGC TCTCCAGAAG GGAGTCCAAG
TACTACGCGG ACGGCCTGCG CTTCGGGGTG GTATTCTCCC TGGTCGCCGT GGCCGTGCAA
GGCGCCGTCT TCGGGCACGT TATAGGGACG GAGATAGCGA AGTACAACCC GGAGAAGCTC
GCGGCGATGG AGGGTACGAG CAAGGCTGTG CTGAGCATCC CCAGGGCGCT CGGGATAGAG
TCCTTGATGA AGGCGATAAT ATTCGGGAAC CCCGGCGCCG CGATGCCTTC ATACGACGAG
ATACCGGTCG ACTACTGTAG GCTCGACAAC CTGCCGCCGG TACAGGACTG CAGGCCGCCC
TTGATCCTGC ATTACCTCTA CTACTCGAAG ATAGGGCTCT CGTTGCTCCT AGGGCTACTC
GCGCTAGCCG GGACGGTGCT TATCTACACG GGGAGGGCTC CGGGTAGGCT GACGCTGTAC
GGGTTTGCCG TGTCCCCCGT GATTGCCCAC GCCGTTTCCT TCCTCGGGTG GGCTGTCAGG
GAGATGGGCA GGAAGCCTTG GACGATATAC GGCGTCATGA CGGTCGACGT CGCGCACACG
GCTAATCCGG CCGGCGCCGC CGAGTACGCG GTCGTAGCCT CGATCTTCCT AGGGGTGCTC
GCCGCTCTCA TCTACGCGTC TTGGAGGATC CTCCTCGCGC CCTCCCTGAG GGGTGGTGGT
GAATGA
 
Protein sequence
MNLLEQWLSY PAAIDRILSI IGIEIHWFIL QYVLGLPLVI LVALLAYKRT ANESWLKLAR 
TSAKALGLVF AVGAASGTAS EFGLLVIWPN LLEAAGRYIY FPLYVEIFAF LTEITFIYLL
VFGWGKLSLN GRIAVAFLAL LGAWLSGAMI MSVNSYMVAP TGVSPAFSQE AGWLYSQGYP
KILLVVPERL VDALDVGKLQ SLGMEVVGRV GGGVAVYMPS RIVARLAYEA WGGRTVGESV
LALVVKPEAL PSLKATLVKD VVDAVLTETV RTVGYTTVTF KSPVFVGSFL HAVGAALTVT
GFTIAGAYAL LSLLSRRESK YYADGLRFGV VFSLVAVAVQ GAVFGHVIGT EIAKYNPEKL
AAMEGTSKAV LSIPRALGIE SLMKAIIFGN PGAAMPSYDE IPVDYCRLDN LPPVQDCRPP
LILHYLYYSK IGLSLLLGLL ALAGTVLIYT GRAPGRLTLY GFAVSPVIAH AVSFLGWAVR
EMGRKPWTIY GVMTVDVAHT ANPAGAAEYA VVASIFLGVL AALIYASWRI LLAPSLRGGG
E