Gene Tpen_0233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0233 
Symbol 
ID4600703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp209029 
End bp210459 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content54% 
IMG OID639772987 
Productpreprotein translocase subunit SecY 
Protein accessionYP_919646 
Protein GI119719151 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0201] Preprotein translocase subunit SecY 
TIGRFAM ID[TIGR00967] preprotein translocase, SecY subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.745222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGCTTA AAGAATCTTT AGACAGCGTT TTTAGATTTC TCCCGGAGAT AGAAAAGCCA 
CGGAGAAAGC CGCCTCTCAG CGAAAGGTTG CTCTGGACGG CCCTGGTGCT AGTCGCCTAC
TTCGTAATGG GTCAGACACC GCTGTACGGT ATTCCGAGGC AGACCCAAGG CACTCTAGGC
GCCCTCGAGT TTCTAAGAGT CGTCATGGCG TCCAAGAGGG GGACTCTCAT TGAGCTTGGT
ATAGGTCCAA TAGTGACTTC CGGAATAGTC TGGGAGCTAC TGGTCGGTAG CAGGATAGTA
AACCTTGACC TCACGACACC GGAAGGCAGG AGGACTTTCG CAGGTCTACA GAAACTCACG
GCTTTCCTCT TCGCAGCGTT AGAGGCGGCA GCCTACATAC TCGGGGGCGT TTACGGAGCC
CTAACACAGC AACAACAGAT CATAGTCTTC GTGCAGCTAT TCGTTGCGAG CACGTTCGTT
ATACTCATGA ACGACATGCT CGAAAAGGGC TGGGGCATAG GGAGCGCTGT CTCGCTATTC
ATAGCGGCAG GTGTCGCTCA ACAGATCTTC TGGGAACTCT TCAGCCCGAT AGGACCCCTA
GGGGACGGGC TCTACTACGG GCTCTTCCCG TCGCTCTTCT CCGCGCTGGT CAGCGGTAAC
TCAACGCTAC TGATGCATGT CGTAGTTCGA CCGAGCGGGT ACCCCGACCT TGTAGGCTTC
GTGGGAATGG TTGTTATGTT ACTGCTACTA ACGTACATGG AGTCGATGAA GATCACGATA
CCAGTTTCTA GCGTTAGGTT TGGCGGGGCG AAAACGAGGA TACCGTTGAA GTTCCTCTAC
GTATCGGTCA TGCCGGTAAT CCTCGTAGGC GCTCTCTATG CCAACGTGGT GATGTTCACG
CAGGCGCTGT GGCCCAGGGT GAATCCGGGC AACCAGAACC CCTGGCTCAA CGTTATCGCA
AAGTACAACT ACACGGAGTA CGGCCCGGTG CCTCTACCTG GGTCGTTCGT GTACTACATA
TCTCCTCCGC GCTCACTTGC GTCCGCCCTC GCCGATCCCG TCCACCTAGT GGTGTACTCT
CTGCTCTACA TCGGGTTCGC CGTCCTCTTC GGAGTAGCCT GGATCCTAAC AAGCGGCATG
GATCCCGAAA CGCAGGCGGA GCAGCTCGCA AAGGCTCAGC TACAGATACC CGGCTTTAGG
AAAAGCGAGA AAGTCATAGC ATCCATGTTG AAGCGCTACA TCTGGGGGTT AACGATACTG
AGTAGCATAA TAATAGGCGT CATCGCTGTA GTCAGCGATA TATTCAGAGT AATGGGTGGC
GGCACGGGCA TACTACTGTT GGTAGGCATA ATAGTGCAGT ACTACTCCAT ACTGGCGAGC
GAGAGGGCAC TCGAAATGTA CCCATCGCTC GCGAGGCTCA TTGGAGAGTA A
 
Protein sequence
MGLKESLDSV FRFLPEIEKP RRKPPLSERL LWTALVLVAY FVMGQTPLYG IPRQTQGTLG 
ALEFLRVVMA SKRGTLIELG IGPIVTSGIV WELLVGSRIV NLDLTTPEGR RTFAGLQKLT
AFLFAALEAA AYILGGVYGA LTQQQQIIVF VQLFVASTFV ILMNDMLEKG WGIGSAVSLF
IAAGVAQQIF WELFSPIGPL GDGLYYGLFP SLFSALVSGN STLLMHVVVR PSGYPDLVGF
VGMVVMLLLL TYMESMKITI PVSSVRFGGA KTRIPLKFLY VSVMPVILVG ALYANVVMFT
QALWPRVNPG NQNPWLNVIA KYNYTEYGPV PLPGSFVYYI SPPRSLASAL ADPVHLVVYS
LLYIGFAVLF GVAWILTSGM DPETQAEQLA KAQLQIPGFR KSEKVIASML KRYIWGLTIL
SSIIIGVIAV VSDIFRVMGG GTGILLLVGI IVQYYSILAS ERALEMYPSL ARLIGE