Gene Tpen_0698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0698 
Symbol 
ID4602013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp648295 
End bp649722 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content57% 
IMG OID639773472 
Producthypothetical protein 
Protein accessionYP_920103 
Protein GI119719608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.482742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGGG AAATGATGGG GTGTGAGAAA AACGTGGCGG CTGAGGAGGC AGGGTTCGAA 
GAGTTCGACG AGTGGTTCAT TCGCAGGAAG ATCGAGGAGC TCCTGAGGAA GGCGCTGTTT
CCGAGCGACG GCTATAGGGG GCTCGAGGCT ACTCGCAACG AGGTTGCGCG GGAGCTTGCG
GAAATAAGGG GCAACTATGC TCGGTACGGC GAGGCCTACG TAGAATACGT TGGCAGGCTC
GGGGAAGGCG AGGAGAAGGC GATCAAAGAG GCTGAGAAAG CGATGAGGGA AGCCCTGGAG
CACGTAGACG AGCTCGAAAT CAGGAGGACA GGCAGAGCGC ACTGGAAGGC TAAGCTTCCA
GGCAGGAGTT GGAGGCTATA CGTACACTTG AAGCCTAGCG GGTACTGGGA GGTCGAGATC
CGTTTACATC TCAGAGTCGT CAAGCTCAGG CTACCCGATA CCCTGAGGCT CCCACCTGAG
CTACTTAGAG CCGCCCAAGA GGGCTGGATC ATGGGAGACG CATCGTACCG CGCGGACCGC
GAAGAAGTCA CGATGACTAC AGCGCAGACA TGGCAAGTAG CCTCCTTCCC TGGCTTCTGG
CCGGGGAAGG AGGTGGCTGT CTACGTTGAT AGAATCGAGA TCCACGAGAC GAGAGTCAGC
GTTAAGTGGT ACGTGGTTGT TAAGGGTGTT CGCGACGCGC CTCGCTGGTG GAGTCTCTCC
AAAAAGGAGA AGCAGGGTAT TATCTTGGCG GAGATCGAGG CGGCGAACAG AGGAGAAATC
GACATTTTCA GGGCGCTAAA GCTCGCATTG CTATACGTTA CGGATGGAAT GTATCCAGGG
TCAAGCAACA CAGCTAGACA TGTGCTGGAT TTCGCGGTCG GTCAAAATTC TCGCCGAGTT
AGAACCGAGG GTGCGGTGAA GGTTGCGAGG CTTCTCTACG AGAAAGTACC GCAACTCTTA
GCGTACATGG TTGCGTCAGG CTGCCAGAAA GCAGAGTTCT TAGCGAGCCT GGCATCCGTG
AAGCCGCGAC ACTACGCACC TCGCTACCTA GAGGTTGCGG GTGTCAAAAT GACCTTGCTA
CTCGTAGGCG CTTCACGCGC GCTTGCGGCA GTGGTGTACG TAACCGAGGA TAACGAGGAG
ACGCTTAGGG GTTTCCCTGA GAGGGCGAGG CGGGAGGGCT TAGAAGTCAG GAAGGTGAAG
GTGAGCAAGG GGCGCTGGGG TTATCGCGCC GGGCAAAAGG AGTTGCTAAG GTATGCCGAT
AAGCGCCCAA TAGTTTACGA CACACTCATA GCGTTCGTGG AGGAGAGACT CGGAGCAATG
CCTCTCAATC ACCCAGCAAG GCCGAGTGTG GAGCGCCTCC TGGAACGCCT AAAAAAGGCG
CGCGAAAGAG CGCTTAGAAA GCTGGGGGAC CAAGACGCTA AAGAGTAA
 
Protein sequence
MGREMMGCEK NVAAEEAGFE EFDEWFIRRK IEELLRKALF PSDGYRGLEA TRNEVARELA 
EIRGNYARYG EAYVEYVGRL GEGEEKAIKE AEKAMREALE HVDELEIRRT GRAHWKAKLP
GRSWRLYVHL KPSGYWEVEI RLHLRVVKLR LPDTLRLPPE LLRAAQEGWI MGDASYRADR
EEVTMTTAQT WQVASFPGFW PGKEVAVYVD RIEIHETRVS VKWYVVVKGV RDAPRWWSLS
KKEKQGIILA EIEAANRGEI DIFRALKLAL LYVTDGMYPG SSNTARHVLD FAVGQNSRRV
RTEGAVKVAR LLYEKVPQLL AYMVASGCQK AEFLASLASV KPRHYAPRYL EVAGVKMTLL
LVGASRALAA VVYVTEDNEE TLRGFPERAR REGLEVRKVK VSKGRWGYRA GQKELLRYAD
KRPIVYDTLI AFVEERLGAM PLNHPARPSV ERLLERLKKA RERALRKLGD QDAKE