Gene Tpen_1703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1703 
Symbol 
ID4601665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1643713 
End bp1645086 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content62% 
IMG OID639774476 
Producthypothetical protein 
Protein accessionYP_921101 
Protein GI119720606 
COG category[S] Function unknown 
COG ID[COG4938] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0867999 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAATGC CTGTTAATGC GCGTGTATAC GTGAGGGACT TCGGTCCTTT CGAGGAGGCG 
AGTATAGAGG TTAGGCCGCT GACGGTTCTC GTGGGCAGGA ACAGTGTGGG TAAGTCGATG
CTATTGCAGC TAGTGTGGGC GCTGACAGTA GCGATGCCCG ACTTAAAGTT GCTCGGCGAG
GCTGTCTCAG AGCTCGGGGC AGGGGAGCTC GTAGGCGAGG TTTTGGAGGG TGTCGAGAAA
GGGTCTGCGT CCCGGGATAG CTTCAAGAGG CTCCTCAAGC TATACCTGGA GGCGCTTCCC
GGGGCCCTCG CTACGGGCCT CGGCAGGACG CTCGAGGAAG TGTTCGGCGC TGGTCTGCAG
GAGCTCGTGC GGAGCGGCTC GGGGCGGGCG GTTGTGGGTG TTGCGGGCTC GTTTTCCTCC
ATCGAGTTCG TGATTGAGGG TGGGCGCCTC GCAGTGAGCC GGCACAAACC CTACGAGGGC
TTCCTGGACG AGCTAGAGGT CACTGTGCCG GCGCCTGGGA GGCTTAGGAT TATTCATAGG
CTCTCCGGCG TCAGTCTGTA CGACGAAGCG GTTTTGAGTC CGTCCGACTT GGTTGACGCG
GTGCTCAAGG TGTTAGCGAT TTATTTGTAC AAGGCTCTCG ACATCTTCTT CGAGGCTCCC
GGCGTTTCCG TGCTTCTGCC TGACAGCAGG GCTGGGCTTT CAAGGATCCT CCTTAAGCCC
TACGCTAGGC CTAGCTTGCT TAAGGACGTG TTGTACCCCG ACGAGCACTT CAGGGACGCC
TACTTCATGC TGGCCGAGAG CCTGGCCGAG GGGAAGGTCG ACACGGGAGA CTTGGAGGAC
TTTCTGAGAG AGCTCGGTTG TAGCGTGGAG GCAATCCCGG AGGGCGGGGT GCGCGCAGTA
TACGTCAACA CGTGGAGTGG CCAGAGGCTT CCCCTGCCCC GCGCCCCCTC GGGTGTGCGC
GAGTCGCTAG CTGTAGCGCT GGCCCTCGTG GTTCCAGAGC AACCATGGCT AGTGTTTATC
GAGGAGCCCG AGGCCCACCT GCATCCTCGG GCGCAGAAGG CTTTAGCGAG GCTTATCGCT
AGGGCTGTCA AGAAGCACGG GAAGGTGGTG GTCCTCTCGA CGCACAGCGA TTACCTGCTC
TACGCGGTTA GCAACATGGT GGCGTTGTCC TCGTCGCCGG GTGTGGCGGA GAGGCTGGGG
TACAGCGCGG CCGAGGTTTT GGATCCAGGG CTCGTGGCGG CCTACTTGCT CAGGGCTGAG
GGGAGGAGGG CTGTTGTCGA GAGGTTGGAT GTGGGGCCGG AGGGTGTGCC TGAGGAGGAG
TTCGTGAGGG TCGCCGAGGA GCTGGCGGAG GAGAGGGCGA GGATCCTGGC TTAG
 
Protein sequence
MGMPVNARVY VRDFGPFEEA SIEVRPLTVL VGRNSVGKSM LLQLVWALTV AMPDLKLLGE 
AVSELGAGEL VGEVLEGVEK GSASRDSFKR LLKLYLEALP GALATGLGRT LEEVFGAGLQ
ELVRSGSGRA VVGVAGSFSS IEFVIEGGRL AVSRHKPYEG FLDELEVTVP APGRLRIIHR
LSGVSLYDEA VLSPSDLVDA VLKVLAIYLY KALDIFFEAP GVSVLLPDSR AGLSRILLKP
YARPSLLKDV LYPDEHFRDA YFMLAESLAE GKVDTGDLED FLRELGCSVE AIPEGGVRAV
YVNTWSGQRL PLPRAPSGVR ESLAVALALV VPEQPWLVFI EEPEAHLHPR AQKALARLIA
RAVKKHGKVV VLSTHSDYLL YAVSNMVALS SSPGVAERLG YSAAEVLDPG LVAAYLLRAE
GRRAVVERLD VGPEGVPEEE FVRVAEELAE ERARILA