Gene Tpen_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0468 
Symbol 
ID4600996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp427168 
End bp428466 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content57% 
IMG OID639773236 
Producthypothetical protein 
Protein accessionYP_919880 
Protein GI119719385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGGT ATGAGGAAAT CGAGAAGACT GTTAGGGAGG AGACGTGGAG GAGGGCGGCT 
TTAAGCCTCT ACGCCACCAG AACAGAGGAT AAGAGGAGGT CCAGGGGTAA GAAGAAGAGG
GGCGAGATAC ACTACCGCGG CCTCTACGAT ACGGTGTCTG GGATTAACTG GGACTTCACC
AAGTTCCTCG TAAACGGCTA CAGCGTCGTG CCGGACAGCG TTTACCCGAG GTTTTACAGG
TTCATCGACT ATGACTTGAG GAAGTATCTC TTGCTGAACG ACGACGAGAA GCCACGCGAA
GGAGGCGCGG TAATGGAGCT CAAAGGAAGG TTGCAAGCAA TCGTCGACGC CGGCGCCGAC
GGTCTTAGAG CTGAGAAGAA GGGTAAAGTC TGGCATGTAT ACATACCTAG AGAGAACTGG
CACGTGAGGG TCTCGAAGCC TACGCATGGC TGGTCCGTAC ACATCCCATT GGAGGGCTTC
TGGGTCGAAT CGGAGTTTCC CCAGGTTCTA GTGAATACTC CGAGCGATGT TCTCAGGAGC
CTGCAGAAGG GGTGGATCCT TACGGATGTG ACACCCCCTC ACGGGCGCTA CAGCGACGTA
CGTTTCGGTA CCACTCAACC GTGGCAGTTG CCAGCGACGC TCGCAACCTT TCCAAGCGAC
GATGCCAGGC TCGGCGTTAC GGCGGGCATA CTCGGTAGCA CCAGGCTGAG CTTTCAGTGG
CAGGTGCGTG TCTACGGTTA CGAGGAGGAG CTGGGCTGGG CCTCCACGCT CATTGGCGCT
ACGAAGCGCG TGGAGTTCCG CAGGTTGGTC GAGGAGTGCA AGGAGCTCAA CGGCGACCCA
GCGTCTCTTT TCACTACCTT CCTGGGAGAC GGCTACCTCG CATTCTTTCT AAGGCTTCGG
ATGCTCCACT TCAGGATAGG CAGCGAGGTT TTCTACCTCC CAGCTGAGAG CGCCATAATC
AACGCTAGGC TTGCCGTGGA GAGGGCTAGC GAGTACACCA AGTTCGTCTC ATTGGTGACG
AAATGCGCTA AGATCAAACA CTTCCTATTC GTCGGCTTCG GATTACCTCG GAAGAAGGGT
AGGAAAAACG GGCAGAGAAA CAACCCGTTC TACGCCGAGA TAGCAGGGGC ACAGCTACAC
CTAGCCTATG TATCCAGCAC CAACAACATT TACGCGAGGA TCGCAGTCGA AGCTGTGCCT
TCGGGCTGGG TGGAGGAGGC ACGCGCTCAA GGCTGGGACG TCCGGGTGGT TCGAATGGGT
GGGGTAGGGA GTACTACCAG GTTACACACG CCTCGCTAA
 
Protein sequence
MMGYEEIEKT VREETWRRAA LSLYATRTED KRRSRGKKKR GEIHYRGLYD TVSGINWDFT 
KFLVNGYSVV PDSVYPRFYR FIDYDLRKYL LLNDDEKPRE GGAVMELKGR LQAIVDAGAD
GLRAEKKGKV WHVYIPRENW HVRVSKPTHG WSVHIPLEGF WVESEFPQVL VNTPSDVLRS
LQKGWILTDV TPPHGRYSDV RFGTTQPWQL PATLATFPSD DARLGVTAGI LGSTRLSFQW
QVRVYGYEEE LGWASTLIGA TKRVEFRRLV EECKELNGDP ASLFTTFLGD GYLAFFLRLR
MLHFRIGSEV FYLPAESAII NARLAVERAS EYTKFVSLVT KCAKIKHFLF VGFGLPRKKG
RKNGQRNNPF YAEIAGAQLH LAYVSSTNNI YARIAVEAVP SGWVEEARAQ GWDVRVVRMG
GVGSTTRLHT PR