Gene Tpen_0561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0561 
Symbol 
ID4600604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp508524 
End bp509999 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content57% 
IMG OID639773332 
Producthypothetical protein 
Protein accessionYP_919970 
Protein GI119719475 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAGAG TTAGGGGGAT TCACAGCACG GCGATAGCGG GGCTCTTGGA CGAGGCTGGT 
TTCCGCTTCG CGGACCTCAG CCAGGAGCTC CTGGCGCGTA TCCCCCAGCT GAGGGTGGAG
GAACGCGTTC TCGTCACCGT TAAGGACACA GATGACAGGA GCGGCGTAGT AGTACTGGGC
GACCGCGCTG TTGTCGAAAA AGTGGCATAC CTCCTCCGCG CGGTCATCCC GGGAGCTCTA
GTTAGCTACG TGGGCGAAGG TCCCTATACG ACGTACGCGG TGAGGCTTCT CTCGAGGGTG
GAGGGCGACG TCTACGAGGC GGAGTATTCG CCCGGGAAAC GCACGACGGT CAAGCTTCGT
CGCCCTCACG TAGAAGGCGA GGTGATAATG GCCCACGTGA TCAGGGCCTC CCCGGAGGCC
CCCTTGCTAA AGGAGGGGGT GGCGATTACG GGGAGCCTCG TGAGGCTGGT ACAGTTCGAC
AGGCACAGTG TAAGCGAGCA CATCAGGGAC GAGAACCTTA GACTGCAGCT GTTAACGCTC
GCTATGACGA GCGCGCCGAC GGGGTGGGGT GTTCACTTCA GGAGTGCGTC GAAGAGGGCG
AGCATAGTCG ACGTAATGGC GGAGATTAAG GCTCTAAGCG AGAAGGCCGA GAAAATCCTT
AAGGAAGTCG CGCCGAAAGA GCCCGGCGTA GTTGTTCCCG GTGAGGCGAT AGCGATAGTA
GAGATCCCGG CGGACGCGTC GATACGGATG GATGCGCTTC GATCGCGCTA CTACCCGACG
CTACCGCTCC ACCACCTTCT TAAGCGGCTG GGAGACGACG AGCTCAGCAG GGCTGTAGAT
TTCTCGGAGA GGCTCCTCGC AGGTTGCGAG AAGTGCCTAT CGAGTACGGG GGCAATAGAG
GTTTTCCTGG AGAGGCTCTC CTCGCTTAAA GGGCGCCAGG TAAGCGTGTT GCACAGGAAG
GTCGCGGGCG CAGGCCACGT TTGGAGCGCT GAAGTGGAAA GCGTGAAAAG AATGACTGTC
GTTTTGAAGC GTGTAGTATC GTCGCCAGGT CTATACGACG GCTTCGAAGG CTTAAAGCGA
GAACCTGGAG ATGTTATCCG CTCATATACC TGGCTTTTCG GGAGGGCGGT CGTCCACTTT
TACACTTCGG CCCGGGGAGA ATTGAAGGGG GTATACGTTA ACATAAATGC TCCGGTATTC
TTCGCCGGCA ACGCTAACAC CCTTGGCTAC GTAGACCTAG GGGTAGACGT TACCCGTGCA
GCAGATGAAG AGCCGAAAGT AGTGGATCTC GCCGAGTTCC TAGACCTCGT GGAAAGAGGG
GTTCTAGACA AACAGCTAGC AGGTAGCTAC CTAGAGTTCG CGGAGTCCGT GAAGCATCTC
TTGGAAAAGG ATATTGGGGA AGATCTCCCG GCTAGGATTA TGCAGGCACA AAAAAGCATA
TTTTCATTTG AAACGGACAA GCTATTAGCA GTGTGA
 
Protein sequence
MIRVRGIHST AIAGLLDEAG FRFADLSQEL LARIPQLRVE ERVLVTVKDT DDRSGVVVLG 
DRAVVEKVAY LLRAVIPGAL VSYVGEGPYT TYAVRLLSRV EGDVYEAEYS PGKRTTVKLR
RPHVEGEVIM AHVIRASPEA PLLKEGVAIT GSLVRLVQFD RHSVSEHIRD ENLRLQLLTL
AMTSAPTGWG VHFRSASKRA SIVDVMAEIK ALSEKAEKIL KEVAPKEPGV VVPGEAIAIV
EIPADASIRM DALRSRYYPT LPLHHLLKRL GDDELSRAVD FSERLLAGCE KCLSSTGAIE
VFLERLSSLK GRQVSVLHRK VAGAGHVWSA EVESVKRMTV VLKRVVSSPG LYDGFEGLKR
EPGDVIRSYT WLFGRAVVHF YTSARGELKG VYVNINAPVF FAGNANTLGY VDLGVDVTRA
ADEEPKVVDL AEFLDLVERG VLDKQLAGSY LEFAESVKHL LEKDIGEDLP ARIMQAQKSI
FSFETDKLLA V