Gene Tpen_0132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0132 
Symbol 
ID4600717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp107087 
End bp108820 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content54% 
IMG OID639772886 
Productmembrane protein-like 
Protein accessionYP_919545 
Protein GI119719050 
COG category[S] Function unknown 
COG ID[COG1470] Predicted membrane protein 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGGA GAAAGTTAGC CCTGGTCGCG CTCGTATCCC TAGCCTTCGT GCTAGCCGCA 
TCTATTCCTT CGTATCCGCA GTCGAATGTA TTCTTCTTTG GGAAGGTGGT CGACGAGAGC
GGGGCGCCGG TAGGGCTTGC GGAGATAACG GTGTACAAGG GGAACTCCAT AGTTACATTC
ACCAGAACCT CGCCTGACGG TAGCTTCAAC CTCTACGTAC CAGTGGGAAG CTACACGTTT
CTGGTATACA AGAAGGGCTA CGCGCCCATC TACATGACCC TCGAGGTAAC CCCTGAGAAA
GGGGGGAGCC TCGGCGTACT GGTTCTGAAG AAAGGAGTAA CCGTAGTACC CGATGTAACG
TCTGTTTATA CTTCTCAGGG AGATACCGTC AGAATACCGG TAAAAGTGTT CAACAAGTGT
CTCGACCCTG TTCTCGTAAG CTTCTCCATC GAGGTCCCCC AGGGCTGGAG AGCCTACTTC
GTGGGACCAA ACAACCTAGT AGCCTCCGAT TTCTACATAG AAGCAGGTAG CAACAGGAGC
CTGGTGTTCG TGGTAGAAGT GCCTCTAAAC GCCAGCGAAA AGGAGAGCGT GAGAGTTTCC
TTTACTTGGT TTAACCTCTC GGGACACGTT GACTTCACGT TTACGGTTAG GCAACGCGAG
TGGAGGCTCT TAGAGCTACC TACCACATCG GTGAAAAGCT TTCCTGGAGG GCAACTGTCG
ATACCCATAA ACGTGTCTAC GCCGTTCAGC TACGAGGCTA CCGTAACCCT CTCGGTGTTT
GCGCCGAGCA ACTTCATAGC CTCCTTGGTC GACGAGAACG GCTTGACTGT GCAGTCTGTA
ACGGTGAGGC CGGGGGAGAA GCGCAGGCTT CAACTAGTGA TATACGTGCC GCCCACGGCG
AGGATTTCTA CGTACAGCTT AAGGGTAGTG GCGAGGTCCG GACCTCTTCA AAGCATCGTG
AACCTCGACG TCATAGTAGA AAGCGCCTAC GACCTCCTCA AGATAAGCCC AGGCGCGACG
AGCATTAACG TGACCTCGGG TTCAACCGTT ACCTTCAGGG TAGCCTTGAA GAACGAAGGC
AACATGCCTA CAGTCGCGTT GCTAAGGGTT CAGACGAGCT CTCCACTGCT CAGGGCGTAC
ACCTCCGTAT CGGGCGAGCC CGTGGCCTCA CTCTACCTGG TGCCCGGAGA AGAGAAAGCC
TTAGCCTTGA TAGTAGAAGT TGACCCCTCC ACGCCCTCCG GGATATACCT CGTATCCCTA
CGCGCGAACG GCACTACGAG TACTGCGGAG CAAAGCTTCC TGGTAAGAGT GACCGGCACA
AGGAAGATCG TAATTTCCAA CATAATCTTC CAGGTTACCG GCGCTCCAGG AATGACGACT
ACGTACAAGC TAGGGCTGGT GAACGCTGGG AACACCCCAT TGAATAACGT TATGGTTCAA
GTGGAGGCTC CGAGCGGCGG ATTCGAAGTA ACGGTGCATC CCTCCACACT GGCTTTACCG
CCTAACTCCA CCGCAAGCGT AGACATATCG ATACTTATCC CCTCGAACGC TAGCGAGGGG
TTCTACAACC TACCCATACA CGTGGTGGCA GGCGACATCA GGCTGGACAG AGTTCTCGTG
CTCGAAGTTA GGGGGGAACA AGGACTAGGC TTCACGTATA TTGCAACCGG GCTCTTCCTC
TTGAGTTTAA CGGTGGTTTC CTACTCTAAG AGGCAGAGGA GAGCCAGGGG GTAG
 
Protein sequence
MSRRKLALVA LVSLAFVLAA SIPSYPQSNV FFFGKVVDES GAPVGLAEIT VYKGNSIVTF 
TRTSPDGSFN LYVPVGSYTF LVYKKGYAPI YMTLEVTPEK GGSLGVLVLK KGVTVVPDVT
SVYTSQGDTV RIPVKVFNKC LDPVLVSFSI EVPQGWRAYF VGPNNLVASD FYIEAGSNRS
LVFVVEVPLN ASEKESVRVS FTWFNLSGHV DFTFTVRQRE WRLLELPTTS VKSFPGGQLS
IPINVSTPFS YEATVTLSVF APSNFIASLV DENGLTVQSV TVRPGEKRRL QLVIYVPPTA
RISTYSLRVV ARSGPLQSIV NLDVIVESAY DLLKISPGAT SINVTSGSTV TFRVALKNEG
NMPTVALLRV QTSSPLLRAY TSVSGEPVAS LYLVPGEEKA LALIVEVDPS TPSGIYLVSL
RANGTTSTAE QSFLVRVTGT RKIVISNIIF QVTGAPGMTT TYKLGLVNAG NTPLNNVMVQ
VEAPSGGFEV TVHPSTLALP PNSTASVDIS ILIPSNASEG FYNLPIHVVA GDIRLDRVLV
LEVRGEQGLG FTYIATGLFL LSLTVVSYSK RQRRARG