Gene Tpen_0551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0551 
Symbol 
ID4600515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp499337 
End bp500755 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content56% 
IMG OID639773322 
Producthypothetical protein 
Protein accessionYP_919960 
Protein GI119719465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAG TAGTAATGAG GGGGGATGGA GCGGGGGAGG GGGTTGAAGA GTTCGACGAG 
TGGTTTATTC GCAGGAAGAT CGAGGAGCTC CTGAGGAAGG CGCTGTTTCC GCCCAGCGGC
TATAGGGGTC TAAAGGCTAC GCACAGCAAG GTTGCACGGG AGCTAGCAGA AATAAAGGCC
AACTATGTGC AATACGGCGA GGCCTACGTC GAGCACGTCG ACAGGCTCAG GGAAGGCGAG
GAGAAGGCCT ACAGAGAGGC GGAGAAAGCG ATGCAAGAAG CCTTAGAGCA CGCAGACGAA
CTCGAAATCA AGAGAACAGG GAAGACGACG TGGAAGGCTA AGCTCCCAGG CAGGAGATGG
AGGCTATACG TTTGCAGAAC ACCAACCGGG CACTGGCAGG TCGAGGTCGC CCTTCTCTTC
AAGGTCGCCG AGCTCAGGCT ATCCGATACA TTAAGGCTTC CGCCTGAGCT ACTCAGAGCG
GCCCAGGATG GCTGGATTCT TGGCGATGCG TCGTACATCG CAAACAAGAA AGAAGTCAAG
ATGGGTACAG CGCAGACGTG GCAGGTTGCC TCCTTTCCCG GTTTCTGGCC GGGAAAGGAG
GTAGTGATCT ACGTTAGAAG CGTAGTGATC CACGAGTCTC ACGTCAGCAT AATGTGGCAA
GTGAGAGTAC ACGGGGTTCG CGACGTGCCC CGATGGTGGA GGCTAAGGAA GGAGGAGAAG
CGGAGGATGG CAGTCGCCGA GATTGAGGAG GCAAACAAGG GCAATATAGA TGAGCGAAGA
GCCGTTCGGA TCGCAACGTA CTACGCCGCA GACGGAAAGT ATCCAGGATC AAACTCGGCC
CTCCATTATC TGGATTTCGC GGTTGGCCGA AGATCTCGCC GAGTTAGAAC GGAGCAATCC
GTCAGGGTTG CGAGGCTTCT CTACGAGAAA GTGCCGCAAC TATTAGCATT CATGGTCGCG
TCGGGTTGCA AGAAAGCAGA GTTCTTAGCG AGCCTGGCAT CCGTGAAGCC GCGACACTAC
GCGCCTCGCT ACCTGGAGGT GTGCGGTGTT AAAATGAACC TGCGGCTCGC AGGCCCTAAG
AACCGCCGCT ACCTTATGGC CCAAGTATAC ATCACGCGCA ATAACGAGGA GATGCTTCGC
GATTTCCCCG AGAGGGCGAG GCGCGAGGGT CTAGAAGTCA GAAGGGTGAA GGTGAGTAAG
AGGTATTGGG GTTACCGTGC TGGCGAAAAA TCGCTAATGA AGTATGCTGA CCGATATCCG
CACGTCTACG ACACTTTGAT CGAGTTTGTC CAAGAAGAAC TTCAAGCAAC GCCTCCCGAC
CACCCCGCCC GCCGAAGCAT AGAGCGCCTC TTGGAACGCC TAAGGAAGGC GAAGGAGGAA
GCGCTCAAAA AGCTGGGGCA CCAAGACGCT AAAGCATGA
 
Protein sequence
MNEVVMRGDG AGEGVEEFDE WFIRRKIEEL LRKALFPPSG YRGLKATHSK VARELAEIKA 
NYVQYGEAYV EHVDRLREGE EKAYREAEKA MQEALEHADE LEIKRTGKTT WKAKLPGRRW
RLYVCRTPTG HWQVEVALLF KVAELRLSDT LRLPPELLRA AQDGWILGDA SYIANKKEVK
MGTAQTWQVA SFPGFWPGKE VVIYVRSVVI HESHVSIMWQ VRVHGVRDVP RWWRLRKEEK
RRMAVAEIEE ANKGNIDERR AVRIATYYAA DGKYPGSNSA LHYLDFAVGR RSRRVRTEQS
VRVARLLYEK VPQLLAFMVA SGCKKAEFLA SLASVKPRHY APRYLEVCGV KMNLRLAGPK
NRRYLMAQVY ITRNNEEMLR DFPERARREG LEVRRVKVSK RYWGYRAGEK SLMKYADRYP
HVYDTLIEFV QEELQATPPD HPARRSIERL LERLRKAKEE ALKKLGHQDA KA