Gene Tpen_1032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1032 
Symbol 
ID4600511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp973412 
End bp974626 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content64% 
IMG OID639773810 
Producthypothetical protein 
Protein accessionYP_920435 
Protein GI119719940 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.435102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGCGAAG CTTCGAAAGT CCTCGTCGCC GCTACCACGC GGGACGGCCG CCTGGTCTAC 
TTGTCTGCAG TTGCTAAGCC TCAACAGCCG GGGCTAGACG AGGCTCTCGC GAGCCTCCTG
AAAAGCTTGG CGTACCACAG CTACGAGGAG CTCCAGGGAG ACAGGGTCCT GCTGGAGGCG
AAGAAGGCTT TATCGTCGCA GGGCTTCAAG GTGGAGGACC TCGAGCTCTC TGTTTCCTTC
AGATGCCCCT CCTGCGGAGC CTCCATAAAC TTCTCGCCTG AAACCGTCGT CTACGTGTGC
CCCTACTGCG GGTGGAGCGG CGACGTGTAC GGGAGGGAGC TACGCGTGAA GGCTTGGCCC
CCGGGAGGCA GGGAGAAGCT GGAGGAGATA GTCAGGGGTC TGGGGGGAGT CCTACACGAC
GCGGTTCTGC GCTACGTGCC CTTCTGGGTC TTCAAGGTGA AGGTAGAGGG CTCGTACGCG
GGGACAGCGA CGTACACTGT TACCAGGACG GAGTACGTCA CGGTGATCCA TGAGGGCAGG
CCCCGGCAGA TCCCGACGAC CAGGACGGAG GTTAGGAGGA AGAAGGTTGC GGGCCGCGTG
AGCTTCTCTA CGGTCAAGGG CGTGGGGGCG CGGGTACTCG CCGAGGTGTT CGGAGGGGAA
GGCCTGAAGA GGTGGGTGGA GTACGAGTGG GAGAACAACC CTCCCCCGGA GCTGAGCGCC
GAGCAGGTTA AGCCCGTGGC GCAGAGCTTC CTCTCGGCCG AGGTGGACGC GGGGGAGGCG
CTCGGCATTG CTAGGAGGGA GATAGACTCC GAAATATACG CAGAGATAGA GAGATCGGCG
CGGAGGCAGG TGGAAGGCTC CCTGAAGGAA GTTGCGGTCG AGTCCCTCTC GGTGGACCTC
AAGGTAGTCG AGAAGAGCCT CGTCTTCGTC CCGTACTGGT TCTTCACGTA CAAAGTGGAG
GGAAACCTCT ACGCGGGGGC CGTGGCGGGA CCGAAAGCCA CCCTCCTGAA GGCCGAGCGT
CGCATCTCGA ACATCGAGAG GGCCGCGAGG CTCGCCGGAG CGTGGATAGC CGTGCTGGCC
TCGGGCGCGC TGGCGCAGGT CTCGGTGGGT AGCGACCTCG GCTTCCCGGG CGTCCTAATG
GCGTGGGCTA TAGGGTTGGT AGGAGCCTAC AAGCTCGCCG AGTCGGCGTT CGCCCCGGCG
GAGGTGGTAG CGTGA
 
Protein sequence
MGEASKVLVA ATTRDGRLVY LSAVAKPQQP GLDEALASLL KSLAYHSYEE LQGDRVLLEA 
KKALSSQGFK VEDLELSVSF RCPSCGASIN FSPETVVYVC PYCGWSGDVY GRELRVKAWP
PGGREKLEEI VRGLGGVLHD AVLRYVPFWV FKVKVEGSYA GTATYTVTRT EYVTVIHEGR
PRQIPTTRTE VRRKKVAGRV SFSTVKGVGA RVLAEVFGGE GLKRWVEYEW ENNPPPELSA
EQVKPVAQSF LSAEVDAGEA LGIARREIDS EIYAEIERSA RRQVEGSLKE VAVESLSVDL
KVVEKSLVFV PYWFFTYKVE GNLYAGAVAG PKATLLKAER RISNIERAAR LAGAWIAVLA
SGALAQVSVG SDLGFPGVLM AWAIGLVGAY KLAESAFAPA EVVA