Gene Tpen_0269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0269 
Symbol 
ID4601890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp237423 
End bp238871 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content57% 
IMG OID639773024 
Producthypothetical protein 
Protein accessionYP_919682 
Protein GI119719187 
COG category[S] Function unknown 
COG ID[COG1690] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCGC CACTTAGGAG GATCACCGAG TACCTATGGG AAATACCGGA GAAGTACAAG 
CCCGGCATGA ACGTGCCGGG GCTCGTCATA GCGGATGAAG TGTTAATATC GAAGATGAAG
GAGGATTTAA CGCTCGAGCA GGTCGCTAAC GTTGCCATGT TGCCGGGCAT CTACAAGTAC
TCTATAGTTC TTCCAGACGG CCACCAGGGT TACGGCTTCC CAATAGGCGG CGTCGCCGCG
TTCGATGCAG AGAAAGGCGT GCTTAGCCCC GGAGGCGTAG GCTACGATAT CAACTGCGGC
GTAAGAGTCC TGTCCACGAA CCTTACGGAG CAGGAGGTTA GGCCGAAACT CAGGGAGCTC
GCAGAGACCC TTTTCAGGAA GATCCCCTCG GGGCTTGGAA GCACGAGTGG TTTAAGGCTG
AGCCACGCGG AGCTGGACCG CGTACTCGAG GAGGGGGTCG AGTGGGCTAT TGAAAGAGGC
TACGGCTGGA GAGAGGACAT GGAGCATATA GAGGAGAAGG GGAGAATGGA GGGCGCAGAT
GCAGACGCGG TGTCCAACGA GGCCAAGCAG AGGGGTAGCA ACCAGCTGGG CACGCTGGGT
AGCGGGAACC ACTTCCTGGA AGTCCAGAGG GTTGACAAAA TCTACGACCC CGAGGTTGCA
AAGGTTTTCG GGATTGAGAG GGAGGGACAA GTAACCGTAA TGATACACAC AGGTAGCAGG
GGGCTTGGAC ACCAGGTTGC AAGCGACTAT CTGAAGATAA TGGAGAGAGT CGTAAGAAAG
TACAACATGC CGCTACCGGA CAGGGAGCTT GTATCCGTGC CGGCGACGTC CCCGGAGGCC
GAAAGGTACT TCGCAGCCAT GAAGGCCGCG GCGAACTTTG CCTGGACGAA TAGGCAGGTT
ATCACACACT GGGTTAGGGA AAGCTTCCGA GCGGTTTTCA AGACAGACCC GGATAAGCTC
GGTCTCAATG TGATCTACGA CGTAGCTCAC AACATTGCCA AGCTCGAGGA GCACGTGGTG
GACGGTAAAA GAGTGAAAGT CTACGTTCAC AGGAAGGGGG CTACGCGGGC CTTTCCACCC
GGGCACCCAG AGATCCCCGC AGACTACAAA TCCATAGGTC AACCCGTCCT GATACCCGGC
TCTATGGGTA CTGCGAGCTA CATACTCGTA GGAACGCAGA AAGCCATGGA TTTGACGTTC
GGCTCCTCTC CGCACGGCGC TGGAAGGATG CAGAGCCGCG CTGAAGCGCG TAGAAGCGTG
AGGGGACAGG AGATAAAGTC CGAGCTTGAG AGTAGAGGTA TAGTGGTTAG GGCTGCGAGC
CTAGCTGTCG TCGCCGAGGA GGCTCCAGAC GCGTACAAGG ACGTGGACAG AGTAGTTATG
GTCGCGGATG CAGTCGGCAT TGCTAGGAAG ATAGTGAGGA TGACGCCCAT AGCAGTGGTG
AAAGGCTAA
 
Protein sequence
MAPPLRRITE YLWEIPEKYK PGMNVPGLVI ADEVLISKMK EDLTLEQVAN VAMLPGIYKY 
SIVLPDGHQG YGFPIGGVAA FDAEKGVLSP GGVGYDINCG VRVLSTNLTE QEVRPKLREL
AETLFRKIPS GLGSTSGLRL SHAELDRVLE EGVEWAIERG YGWREDMEHI EEKGRMEGAD
ADAVSNEAKQ RGSNQLGTLG SGNHFLEVQR VDKIYDPEVA KVFGIEREGQ VTVMIHTGSR
GLGHQVASDY LKIMERVVRK YNMPLPDREL VSVPATSPEA ERYFAAMKAA ANFAWTNRQV
ITHWVRESFR AVFKTDPDKL GLNVIYDVAH NIAKLEEHVV DGKRVKVYVH RKGATRAFPP
GHPEIPADYK SIGQPVLIPG SMGTASYILV GTQKAMDLTF GSSPHGAGRM QSRAEARRSV
RGQEIKSELE SRGIVVRAAS LAVVAEEAPD AYKDVDRVVM VADAVGIARK IVRMTPIAVV
KG