Gene Tpen_0423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0423 
Symbol 
ID4602102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp384343 
End bp385791 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID639773188 
Producthypothetical protein 
Protein accessionYP_919835 
Protein GI119719340 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGAG GCATGACGGG GGGTGAAGAG GGCGAGGAGG TTGTTAGGGA GCTTGTCTAC 
GAGAGGGCGG CTCAAAGCTT GTATGCCACC AGAACAGAGG ATAAGAGGAG GTCCAGAGGT
AAGAAGAAGA GGGGCGAGAT ACACTACGTG GGCCTCTTCG ACGCGGTGAC CGGGATTAAC
TGGGACTTTA CCAGGTTTGC GGCGCACGCG CTGACCGTCG TGCCGGACGA AGTCTACCCG
AGGTTTTACC GCTTCATAGA CATCGACGCG AGGAAGTACC TCCTACTGGG AGACGACGAG
AAGCCACGCG AAGGAAGCGC GGCGATGGAG CTACGCAATA GGCTACAAGC AATTGTCGAT
GCAACAGAAG ACGGGCTTAG AGCTGAGAAG AAGGGTAAAG TCTGGCGCGT GTACGTGCCG
CACGAGAACT GGTACGTAAC TGTCTCGAGG CCCAGTACTC ATAGCTGGTC CATACACGTC
CCATTGGAGG GCTTCTGGAC CGAGGCTAGT TTCCCGGAGG TTCTCGCGAG GACATCGCCA
GACGTGCTTA GAAGCCTGCA GAGAGGGTGG CTTCTCACAG ATGTCACTCC GCCACACGGA
CGCAACAGCG ACGTAAATTT CAGTACCACT CAGCCATGGC AGTTGCCGGC AACGCTCGCC
AACTTCCCGG GAGAAGTCAG GCTCGGCGTC ACAGCAGGAG TCCTCGGCTC CACGAGGGCG
AGCATTAAGT GGAAGGCCTA CGTTCACGGT TACGCGGAGG AGCTGGGTTG GGCTTCCGAA
CTCATCGGCG AGGCGAAGCG CGCTGAGTAC CGCAGGCTGG TCGACGAGTG CAGGGCGCTA
CGGGGAGACA GCGTAGCTCT TTTGACCGCC TTTTTGGGAG ATGGTATGCT TGCTTTCTTT
CTAAGGCTTC GGATGCTCTT CTTCAGGATA GGCAACGAGA CTCTCTACCT CCCAGCTAAG
AGCGCCATTG TCAACGTTCG CTTGGCTGTG GAAAGGGCTA GCGAGTACGT ACGCTTTGTC
TCGCTGGTCA CGAAGAACCC GAAGATCCGG CACTTCCTGT TCGTCGGCTT CGGATTGCCG
CAGAAGAGGG GTAAGAAGGG CGGGCAGAGA AACAGCCCAT TCTACGCAAA CATTGCAGGG
GCTAGGCTAC TTCTGGCCTA CGTATCCAGT ACTAACAACA TCTACGCTAG GATCGTGGTT
GATGCTGTGC CTCAAGGCTG GTACGAGCAC GCGCTGGAGG AAGGCTGGGA CGTGAGGATA
GTTGCTTCGG GTACCTCTTC GGGTGGCAAG GAATACTACC AAGTGACGCA AAGCTCTCTC
TTCGAGCACG CCCGCTACGA CGCGGCTCTG CGGGAAACAC TCCTCGCCTT CGCGAAAGCG
AAAGCCGAGC AGTACCCCAA AGCCTGGGAA CTCGTAGAGC GCCTCGAAAA GCTGGGGACA
GAAGACTAA
 
Protein sequence
MGRGMTGGEE GEEVVRELVY ERAAQSLYAT RTEDKRRSRG KKKRGEIHYV GLFDAVTGIN 
WDFTRFAAHA LTVVPDEVYP RFYRFIDIDA RKYLLLGDDE KPREGSAAME LRNRLQAIVD
ATEDGLRAEK KGKVWRVYVP HENWYVTVSR PSTHSWSIHV PLEGFWTEAS FPEVLARTSP
DVLRSLQRGW LLTDVTPPHG RNSDVNFSTT QPWQLPATLA NFPGEVRLGV TAGVLGSTRA
SIKWKAYVHG YAEELGWASE LIGEAKRAEY RRLVDECRAL RGDSVALLTA FLGDGMLAFF
LRLRMLFFRI GNETLYLPAK SAIVNVRLAV ERASEYVRFV SLVTKNPKIR HFLFVGFGLP
QKRGKKGGQR NSPFYANIAG ARLLLAYVSS TNNIYARIVV DAVPQGWYEH ALEEGWDVRI
VASGTSSGGK EYYQVTQSSL FEHARYDAAL RETLLAFAKA KAEQYPKAWE LVERLEKLGT
ED