Gene Tpen_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0783 
Symbol 
ID4601131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp733469 
End bp734596 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content60% 
IMG OID639773559 
Productmonooxygenase, FAD-binding 
Protein accessionYP_920188 
Protein GI119719693 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGACGTAC TCGTACTGGG CGGGGGCCCT GCCGGTTTGC AGCTGGCCAG GTTCCTCAAG 
GGCTACGGCG ACGGCGATGT ATACGTCTAC GAGGAGCACG AAAGGGTCGG GCTACCCCAG
CACTGCACCG GGCTTGTCAG CATAGAGGGA CTTAGGAAGT GGATAGGTGT CGGCGAGCGG
GGGCTCGTCC TAAACAGGTT TAGGGGCGCG CGCTTCGTCT CCCCCTCCGG GAAAGTTTTC
CTGGCAAGGC GGGGCTCCGA GGTTGCTGCT ATAATCGAGA GGAAGCTTTT AGAGGAAAAG
CTTTACGAGG AGGCAGTATC GGCCGGTGCC CGGGTACTTC TAGGCGCGCG GCAAACGCTG
GGAGGCTTCG CGCTTTCCGT GAGGAAGAGG GGGGCGATAG GGGTTATCGC CGGCGGTACG
GGCTTCTTGT CGGTACTGCA CGGTGAGAAA AGAGAGTTCC TGCTACCGGC GCTGCAGCTC
GATGTACGCG TGGAGGACGA GGTTGGGGAT ACAGACCATC TCTACGTGTT TCTGGGCGAG
AAGTTCTCCC GAGGGCTATT CGCGTGGGCA ATCCCGCTCG AGGACGGAAC CTACAGGGTA
GGCCTTGCTA GTAGGGGCAA CGTCCTCCTA AGGCTGAAGT ACTTGATGCT CGCGCTGTCG
CGTGTAGGCG TGAAGGTGGT GAAAAGGCTG AGAGTGTTCG GCGGCGCCGT CTATACGGGC
GGAATGGTCG ACGTCTACGC CGGGGACAGG CTTTTCTTAC TGGGAGACTC GGCTGGGCAG
ACGAAGCCCA CGACTGGGGG TGGTCTCGTG TACCTTTCGA TCGCGGCGCG TGCACTGTCG
GATGCGATAC TGAGTGATAG ACCGGAGGCT TACGGAGAGG CTGTTAAGCG GGCCTTGGGC
AGGGAGATGC ACGTACAGTT GCTCGTTAGG AAAGCCTTGA ACTCTCTTTC GGACGCAGAG
CTGGACGAGC TCTTCCAGGC GCTGAAAGAA GTGGGAGGAG AAGAGATCGT GGCCAGCGAG
GGCTCCATGG ACGTTCAATC GGCGGTGGCG CTGAAGCTCT CCGCGAAACT TTTCCTCTCG
CGCCCAACCC TCCTCGCAAG CGCAGCTCTA AAGTCGCTGG CCTTCTAG
 
Protein sequence
MDVLVLGGGP AGLQLARFLK GYGDGDVYVY EEHERVGLPQ HCTGLVSIEG LRKWIGVGER 
GLVLNRFRGA RFVSPSGKVF LARRGSEVAA IIERKLLEEK LYEEAVSAGA RVLLGARQTL
GGFALSVRKR GAIGVIAGGT GFLSVLHGEK REFLLPALQL DVRVEDEVGD TDHLYVFLGE
KFSRGLFAWA IPLEDGTYRV GLASRGNVLL RLKYLMLALS RVGVKVVKRL RVFGGAVYTG
GMVDVYAGDR LFLLGDSAGQ TKPTTGGGLV YLSIAARALS DAILSDRPEA YGEAVKRALG
REMHVQLLVR KALNSLSDAE LDELFQALKE VGGEEIVASE GSMDVQSAVA LKLSAKLFLS
RPTLLASAAL KSLAF