Gene Tpen_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0418 
Symbol 
ID4600496 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp380439 
End bp381863 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content57% 
IMG OID639773183 
Producthypothetical protein 
Protein accessionYP_919830 
Protein GI119719335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTGTG AAGAGAGGCT TGAAGAAGAG GTTAGGGAGG AGACGTGGAG GAGGGCGGCG 
CTAAGCCTCT ACTCGACCGA GGTCGAGGAT AGGAGGAGGT CTAGGGGCGG GAAGAAGGGC
AGGGTACACT ATCGCGGCCT CTACGACGCA GTGTCTAGGA TTAACTGGGA CTTTACGCGC
TTCGCGGCGC ACGCGCTGAG CGTCGTGCCG GACGAGGCAT ACTCCAGGTT CGGCAGGCTG
TTTGACATCG ACGCGAGGAA GTACCTCCTG CTGAACGACG ACGAGAAGCC GAGGGAGGGA
GGCGCAGTTG TGGAGCTACG CGATAGGCTA CAGGCAATCG TCGACGCTAC CGAAGACGGC
CTCAAAGTTG AGAGACACGG AAAGCACTGG CAAGTATACA TACCTGGAGA AAACTGGCAC
GTCATCGCTT ACAAGCCTAC ACATAACTGG ATCATACTCA TCCCGTTGAA AGGCTTCTGG
GTTGAGTCAG AGTTTCCCAG GGTTCTAGTG AATACTCCGA TCAGCGTTTT GCAGAGCCTG
CAGAAAGGGT GGGTCTTAAC GGATGTGACA CCCCCTCATG GGCGCTACAG CTACGTACAA
TTCAGCACCA CTCAACCGTG GCAGTTGCCA GCCACGCTCG CGGCCTTTCC TAGTGACAAT
ATCTGCTTGG GCGTTACGGC GGGCGCACTC GGCAGTACCA AGCTAAGCAT CGAGTGGAAG
GTGCACGTCT ATAGCTACGA GGAGGAGCTG GGCTGGGCCT CCGAACTCAT CGGCGAGGTT
AAACGTGCAG AGTTCCGCCA GCTTATCGAT GCTTGCAGGG TGCTACAGGG AGACCCGGTC
GCTCTACACA CAGCCTTCCT GGGGGACGGC TACCTCGCCT TCTTCCTGAG GCTTCGGATG
CTCTTCTTCA GTATTGGACA CGAGATTTTC TACCTCCCAG CTGAGAGCGC TATAGTCAAC
GCTAGACTCG CCGTCGAGCT AGCACCAGAG TACACAAAGT TCGTCTCATT AGTGACGAAG
TGTCCAAAGA TTAAACACTT CTTGTTCGTC GGCTTCGGAT TGCCGCAAAA GAGAGGTATG
AGAGACGGTC AGAGAAAGAC CCCGTTCTAC GCCGAGATAG CGGGGGCTAG GCTACACCTA
GTCTACATTT CTACGAGGAA TCATGTCTAC GCGAGGATCG CGGTCGACGA TGCGCCTCAA
GGCTGGGTGG AGGAGGCGCG CGCTCAGGGC TGGGACGTCC GGGTGGTTAA CATGGGAAGC
GAGGAGTACT ACCAGGTTAC ACATGTCTCT TTGATGGAGC ACGCGCGCTA CGACGAAGCA
CTGCGGGAAA CACTCCTCGC CTTCGCGAAG GCGAAAGCCG AGCAGTACCC CAAAGCCTGG
GAACTCGTAG AGCGCCTCGA AAAGCTGGGG ACAGGGCAGG AATAG
 
Protein sequence
MGCEERLEEE VREETWRRAA LSLYSTEVED RRRSRGGKKG RVHYRGLYDA VSRINWDFTR 
FAAHALSVVP DEAYSRFGRL FDIDARKYLL LNDDEKPREG GAVVELRDRL QAIVDATEDG
LKVERHGKHW QVYIPGENWH VIAYKPTHNW IILIPLKGFW VESEFPRVLV NTPISVLQSL
QKGWVLTDVT PPHGRYSYVQ FSTTQPWQLP ATLAAFPSDN ICLGVTAGAL GSTKLSIEWK
VHVYSYEEEL GWASELIGEV KRAEFRQLID ACRVLQGDPV ALHTAFLGDG YLAFFLRLRM
LFFSIGHEIF YLPAESAIVN ARLAVELAPE YTKFVSLVTK CPKIKHFLFV GFGLPQKRGM
RDGQRKTPFY AEIAGARLHL VYISTRNHVY ARIAVDDAPQ GWVEEARAQG WDVRVVNMGS
EEYYQVTHVS LMEHARYDEA LRETLLAFAK AKAEQYPKAW ELVERLEKLG TGQE