Gene Tpen_0544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0544 
Symbol 
ID4600501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp493876 
End bp495510 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content55% 
IMG OID639773315 
Productmembrane protein-like 
Protein accessionYP_919953 
Protein GI119719458 
COG category[S] Function unknown 
COG ID[COG3356] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGGA CTCATGGAGA CCACTTCGAG GGATTCAAAA GTAGCTATAA GTGGATTATC 
AGGCTACCCG CTACACGTGT TCTTGCACCG ATCTCATTCT CGTTGCTACT ACTTTCCCTT
GGAGCCTCCT TGCTACTAGG AGTGCCGCCA AGCGAGTTAC TTCACTCGCT CTTACTGGGA
GCCGCCGTCC CGCTCTACCT CCACGTATGC GATAGGAGAG TGTTTAACTT CAGGCGTTCT
CTAGGGGTGT TCCTCCTGTA CCTCGCAATG CTCCCACTAG TGCTTGTACT CCGCAAGCCC
CCCATACTGG CCACGGTGCT CGAGGGGCTT GTCGGCTTTC TGCTCGCAGT CGGCATCCTA
AGCGTATGGA AAGCCGGCGC TCTGGCTATA CTCTACTTCC TTTGGTTCTC CGTAGCGCAC
AACGACCCCC GCGCACTGTA CATCCTCTCA GCCTTCTTCC TGGCGTCTTC TACTGTATTC
TTCGTGGTCG ACCGCAGGAT AAGGAAGGCG GCGGGAGTGA GCGGGGTAGG CTTCCTCGAG
GCGTTCCTGA AGTACATCCT TTCGGGGGCT CGGTCAGAGG TCGAGGAATA CCTTGAGAGA
ATATCCGTCG AAAGGACCCT GCAGGTACAT GTCTACTCGT TCACAGCCGA TCGCGAAATC
GGGCGCCTCG TGATTTCCAA CGTGCACCCG GGTCCTCTCA GAGACTTGGG TAGTAGCACT
CTGCCACAAC TCATCGCGAA GTGTAGGGAT ACACCAACGC TGTTTTTGAA GGCTCCGTGT
ACTCACTCGG AAAATCTTCC CAGGCTCAGG TACTCCGAGG AGCTGGCAGG CGCCGTGTGC
AGCTGCACGC CTAGCCCCGC GGTTGACAGG TGCGGTTTAG GCTACTCCTC TAGTGGAAAA
GTAGACGTTG TAAGGCTTGC TTTCGGCGAG ATACCGGACC TGGTGTTCCT GGACCCCCAG
GTGATCATGG AGGACTTGCC GTACAGGGTT TCGGAGGAGT CCCACGCCGT GGAGGGGGCC
GTAGCGGTAG ACCTCCACAA CATGATAGCT CCCGGCTACC TGAAGATAGA CGAAGACGAC
GAAATCGAAA TTGCCGAAAT CGTTTCCGCG ATTCACGAGG CGTCCGAGGA GGTTGAAGCA
GAGGGAGAGA TTTACGCCGG CTTCGCGCGT GTAGAATACA CTGATGGAGC CTCCGTAGGT
CCCGGCGGGG TTGCATGCGC CGTGCTTCAA ATAGCCGGCA AGAAACTCCT CTTGCTGAGC
ATAGACGGAA ACAACATGAC GCCAGACTTC AAGGAACGCG TTTTACGCAC GTTCAAGGAG
TCCTTCGAAA AAGTCTTGGT CGCGACTACG GATACCCACC TATACACGGG GCTATACAGG
AACGTCGACT ACTACCCAGT AGGAGCGCTG AGCCCCGAAA AAGTCCTGGA AACCTGTATG
TCTTGCGTTG AAAAAGCTTT GAAAAACGTC TCGAAGGTTA GGGTGGGTTA CTGCTCAGTG
CCATTCAGAG GCAGGTTCAT GGACGGCGAG ATGCTCCGCC GCATCTCTGT TGCAACGAAG
AAAAACACTA GGGACAGCCT AGCCGTAGTC TTTCTGGCGT TTGCTGTCTC GCTAGCCCTT
CTCCTGCTCC CGTAG
 
Protein sequence
MIGTHGDHFE GFKSSYKWII RLPATRVLAP ISFSLLLLSL GASLLLGVPP SELLHSLLLG 
AAVPLYLHVC DRRVFNFRRS LGVFLLYLAM LPLVLVLRKP PILATVLEGL VGFLLAVGIL
SVWKAGALAI LYFLWFSVAH NDPRALYILS AFFLASSTVF FVVDRRIRKA AGVSGVGFLE
AFLKYILSGA RSEVEEYLER ISVERTLQVH VYSFTADREI GRLVISNVHP GPLRDLGSST
LPQLIAKCRD TPTLFLKAPC THSENLPRLR YSEELAGAVC SCTPSPAVDR CGLGYSSSGK
VDVVRLAFGE IPDLVFLDPQ VIMEDLPYRV SEESHAVEGA VAVDLHNMIA PGYLKIDEDD
EIEIAEIVSA IHEASEEVEA EGEIYAGFAR VEYTDGASVG PGGVACAVLQ IAGKKLLLLS
IDGNNMTPDF KERVLRTFKE SFEKVLVATT DTHLYTGLYR NVDYYPVGAL SPEKVLETCM
SCVEKALKNV SKVRVGYCSV PFRGRFMDGE MLRRISVATK KNTRDSLAVV FLAFAVSLAL
LLLP