Gene Tpen_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1402 
Symbol 
ID4601816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1354892 
End bp1356460 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content67% 
IMG OID639774177 
Producthypothetical protein 
Protein accessionYP_920802 
Protein GI119720307 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1361] S-layer domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCCAA TAAAGGCGGA ACGTATGAAG ACTGGACCGG TCTTGCTCTT GATAGCGGTA 
ACCGTAGCGG CAACACTCGT GGGCGCGGGC CCGGTGACCG TCAGCCCCTC GGGGAGCGCC
AGGGCCTACG CGTTCGTACT GGAGGATGTA AGCGTTCTGG ACAGCGGCGT CCCCTCAGGG
GTGGTGCAGG TCTCGGTGAG CTACTACGGG GCGTACACGT TGCTCGGGGC AAGCCTTGGG
CTAACGGCTC GCTGCGGCGG CGCGGAGGTC TCCGCGGGTA GCGTGGACGT CGGCTCCTGG
CGCCCCGGGA CCGTCAAGAC GGCGAGGTTC ACGCTTAACA CGTCGAGCCT CGGGTCGGAG
TGCACCCTGA GGGTCTCGGT GTCCTGGGGC GACTCGTGGG ACGACGCCCA GAAGACTTAC
ACGGGGCTGG GCGGGTCGAC GAGCCTCGAG TACAGCTTTA CGGCGTGCTG GGGCGAGAGG
GTCTCGGTGG GCGTGAGGCC GCAGATGGTG TACTCTTCAA CCGTTAACCC CGTGCTCCTA
GTCGTGGAGA ACTCCGGGCG GACCGCGCTC AGGCAGGTAG AGGTGTACGT CGCGCCTCAA
GGCTCAGTCC TGCTCAACGC CTCCGTGCCT ACCGTTTTCG AGCTGGGAGA CCTGAAGCCC
GGGGAGAGGA GGGTAGTGCC GCTCAGCGTG GTCCCGCAGT CCCCCTTCCC CTCGTTCTCC
GTCACCGTGA GCTACCTCGA CTGCTCCGGG TCCAAGAAGA GCGTGGCGCA ACAGGTATAC
CTCTACGCGG CGGCGGGGCA GAGCATAGTC GTCGTGCCGG ACCCGCCGGT TCTCGTAGCC
GGGCAGGCGT CCAACGTCTC GCTCAGGGTG GTCAACGCCG GCGGGGTCGC CGTTAAGGGG
CTTAGCCTCG TGCTGGGGGT CCAGAAGAGC CCCCTGAGCG TCTCCCCGAG CTTCCTAGTA
GTCGGCGACC TGGGGCCGGG CGAGTCGAGG AGCGTCCCTG TAACCGTGCT CGTACCGGCG
ACGGCTTCTA GCAGCGAGTC CGTCGCCTAC CAGGCTCTCT ACAGCGTGGA GGGAGGCGGG
CTGGCGACTA GCGGAGGGTC GTTCACGTTC TACGTCGCCC AGAGGTCCTC CGTGTCCATA
ACCTCGGTGG ACGTCGTGCC GCAGAGCCCC GAGGTCGGGT CCAACGTCAT ATTCGCGGTG
AGCCTGGTGG ATGACGGCAC GTTCCCCGTC TACGCGGTCA ACGTCTCTGC CTACGCGTCC
AGGGGCCTCT CCCCCCTGCG CTCGACCTAC GCGTACCTGG GCCAGCTCAA CCCCCAGGTC
CTCACGACGG TCCCGTTCAG CTTCAGGGCA GTCGAGGAGG GGATGCAGGA GGTCAGGTTC
GTCGTGACGT ACAGGGACGC GTACGGGTAT TCGAGGAGCG CCGAGAGGAC GGTCTACGTC
AACGTGGCGA GGCAGCAGCC CTCGCGCCAG GCGCAGGGCG GGTCCGCGAA CCCGTACGTC
TACCTCGCCG CCGTTGCGGT AGCCCTGCTG CTGGCCGCCG CGTACGCCGC GAGGAAGAGG
AGGGGGTAG
 
Protein sequence
MPPIKAERMK TGPVLLLIAV TVAATLVGAG PVTVSPSGSA RAYAFVLEDV SVLDSGVPSG 
VVQVSVSYYG AYTLLGASLG LTARCGGAEV SAGSVDVGSW RPGTVKTARF TLNTSSLGSE
CTLRVSVSWG DSWDDAQKTY TGLGGSTSLE YSFTACWGER VSVGVRPQMV YSSTVNPVLL
VVENSGRTAL RQVEVYVAPQ GSVLLNASVP TVFELGDLKP GERRVVPLSV VPQSPFPSFS
VTVSYLDCSG SKKSVAQQVY LYAAAGQSIV VVPDPPVLVA GQASNVSLRV VNAGGVAVKG
LSLVLGVQKS PLSVSPSFLV VGDLGPGESR SVPVTVLVPA TASSSESVAY QALYSVEGGG
LATSGGSFTF YVAQRSSVSI TSVDVVPQSP EVGSNVIFAV SLVDDGTFPV YAVNVSAYAS
RGLSPLRSTY AYLGQLNPQV LTTVPFSFRA VEEGMQEVRF VVTYRDAYGY SRSAERTVYV
NVARQQPSRQ AQGGSANPYV YLAAVAVALL LAAAYAARKR RG