Gene Tpen_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1228 
Symbol 
ID4601725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1166174 
End bp1167721 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content61% 
IMG OID639774004 
Productradical SAM domain-containing protein 
Protein accessionYP_920629 
Protein GI119720134 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.641175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTTTCG AGGTAGTTAT AACGTCCGAT AGGACAATGA TAACGGACCA CCACGGCAAG 
GAGTTCATAG GCTTCATGGC CACTGGGCCC GCTATAGGGG TCCCGGAGAG GCTCTGGATG
TGGGCGTGCT GCCCGAAGCC CAAGGTCGAC AGGCTCGGCA GGCCGCGCGT AGCGCCCTAC
GGGCTCAGGA AGATAGAGGC GAAGCTCCAG GAGGCTGGCT TCAACGCCGC CATCGTCGAC
CCAGACCACT TGGACAAGCA CTTGGACACT ATGAAGGTGC TCCTAGTGGG GCACCACGAC
TACTTTGCTT ACGGCCCGCC GAGCAGCGAG TGGTGGGTTA TCACGGGCAG GGAGCCTGTT
AACAGGAGGA GCTTCAGGAG GCTCATGGAG TCCCCCGCTG TGCGCAAGGC GAAGGAGAAA
GGCGTGAAGA TAATCGCCGG GGGGCCCGCG GCGTGGCAGT GGCTCTGGGA GCTGGAGAGC
TGGAAGAAGT GGGGCGTGGA CACCGTCGTC GACGGGGAGG GTGAGGGGGT CGTCGTGGAC
CTAGTCGAGA AGGTTTACAG GGGGGAGCCG CTCCCAGAGT ACGTCTACGT GAGCCCCCGC
GACGCTCCAA GCATAGAGGA GATACCGGTG ATCAGGGGCG CCAGCGTCAA CGGGCTGGTA
GAGATAATGA GGGGTTGCCC CAGGGGCTGC AGGTTCTGCT CCGTGACGCT GAGACCCCTG
AGGTTCATGC CCATAGAGAA GGTTGTGGCG GAGGTCAGGG TCAACGTGAG GGCTGGGCTG
AGGAACGTCC TGCTACACAG CGAGGACGTC CTGCTCTACG GCGCCGACGG CGTAAAGCCG
AGGCCCGAGC CCGTCCTAAA GCTCCACGCC GAGGTGCTCA AAGAGGCACC CGGCAGCGTC
GCGTGGTCCC ACGCAAGCCT ATCCGCCGTG AAGTACGCCG AGGATAACTA CAGGCTGGTA
TCGCGATTAA TGGAGATGCT GAGCGAGAGG CAGGAGATAC TTGGGGTGGA GGTCGGGATA
GAGACGGGTA GCGCGAGATT GGCGAGGGAG GTCATGCCGG CGAAGGCGCT ACCCTACAGG
GCGGAGGAGT GGGTGGAGGT CGTGAAGGAC GCCTTCGCGA TAATGCACGA CAACAGGGTT
GTCCCAGCGG CTACGCTTAT ACTCGGGCTA CCCGGCGAGA CCCCGGACGA CGTCGTCAAG
ACCGCGGAGC TCGTCGACGA CTTGAAGCCC TACAGGAGCC TCATAGTGCC CATGCTCTTC
GTCCCCATGG GGAAGCTGAA GAACATGGAG AAGTTCAGGA GGGAGATGAT AACCAGGGAG
CACGTAGAGG TCATGAAGGC TTGCTTGAGG CACGACCTCT ACTGGGCCCG GGAGATAATG
GGCAAGTTCT ACCTCAAGGG GGCGCACATG GCGCCTTTAA GGTTCTTCCT CGAGGCCTTC
ATATCCTACG TTGAGCGTAG AGCGTCGAGG ATTGACGAGG AGATCAAGCA ACTCTTCGAA
GAAAAGCAAG CCCTAGAGAG GCGGCGGGAA AGCGTCGTCC GCGCCTAG
 
Protein sequence
MVFEVVITSD RTMITDHHGK EFIGFMATGP AIGVPERLWM WACCPKPKVD RLGRPRVAPY 
GLRKIEAKLQ EAGFNAAIVD PDHLDKHLDT MKVLLVGHHD YFAYGPPSSE WWVITGREPV
NRRSFRRLME SPAVRKAKEK GVKIIAGGPA AWQWLWELES WKKWGVDTVV DGEGEGVVVD
LVEKVYRGEP LPEYVYVSPR DAPSIEEIPV IRGASVNGLV EIMRGCPRGC RFCSVTLRPL
RFMPIEKVVA EVRVNVRAGL RNVLLHSEDV LLYGADGVKP RPEPVLKLHA EVLKEAPGSV
AWSHASLSAV KYAEDNYRLV SRLMEMLSER QEILGVEVGI ETGSARLARE VMPAKALPYR
AEEWVEVVKD AFAIMHDNRV VPAATLILGL PGETPDDVVK TAELVDDLKP YRSLIVPMLF
VPMGKLKNME KFRREMITRE HVEVMKACLR HDLYWAREIM GKFYLKGAHM APLRFFLEAF
ISYVERRASR IDEEIKQLFE EKQALERRRE SVVRA