Gene Tpen_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1572 
Symbol 
ID4600558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1521708 
End bp1523303 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content62% 
IMG OID639774345 
Producthypothetical protein 
Protein accessionYP_920970 
Protein GI119720475 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.127794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTATTCTT CCCTTAGGCA GGTCGGCAGG ATAGTCGAGG AGGCTACTCC GGAGAGCTTC 
ATCTTCGTCA CGACGAAGCA GGACCACCCG CCGAAGTACG AGTACGTGCT GGTTAAGTCG
AGGGAGGTGG TCGCCGGCGA GGAGAGGGAG GTGGACGTCC TCTGCCAGGT GACCGGCGTG
GTTTCCAGGA GCGACGCGTA CAGCAGTAGG CTAGACCTGG AGAGCCTAGA GCGGATACAC
GAGGCTGGGA TAGACGACGC CAACCTTCTC TGCGCGGCGC GGACCCTCGG CTACCTCGCC
GAGGAGGACG GGAGGAAGGT GGTCCTGATG CCCAGGAGGG CTTTCTTCCC GGGAAACCCG
GTCTACCTGG CCCCGGACGA CTTCGTCAGG GAGTTTTTCT CGTACCCCGG GGAGGAAGGT
ATACGCATAG GTAGCCTCGT CTCCCGCAGG AACGTCGACG TCTACCTCTC GGTGAACGGG
TTCAGGAGGC ACGTCGCCGT TATAGCCCAG ACCGGCGCGG GCAAATCGTA CACCGTCGGC
GTCATACTGG AGGAGTTGCT GAGGCTGGGA GCCACGGCGG TAGTCATAGA CCCGCACGCG
GACTACGTGT TCCTGAGCAG GGACAGGGAC ATGAGGCGCC ACGAGTACTC GGACAGGGTC
CTCGTCTTCA GGAACCCGAA CAGCACTGGG CGCTACGACC CGAGCCAGAT GGACAACGTG
CACGAGCTGA CGGTGAAGTT CTCCGATCTC TCGGCGGAGG ACGTGGCGAG GATAGCCGGG
ATTCCGGAGA AGTGGACGAA CGTAAGGAAG GCTATCCGCG ACGCCCTCGA CAAGCTCAGG
GGGAGGGACT ACACGATTGA CGACCTGCTC GGCGAGCTGG AGAAGATGTC GCGGGGAGGC
GGCAAGGAGG CTGCCTACGC GTCGAGCGCG TACAACCACC TCGTGAAGCT GAGGAAGTTC
AGCGTGTTCG GCAGGTACAC CACGTCCGTC AAGGACGAGA TCCTGAAGCC GGGGCACGTG
AGCGTGCTGG ATCTGTCCGG GCTTAACGAC GCGAGCCAGG ACTACATAGT GAGCACCGTG
CTAGAGGAGA TATACAGGCT GAGGTACTCG GGGGAGTTCA GGTACCCCGT CTTCGTGGTG
GTAGAGGAGG CTCATAGGTT CGTCCCATCC AAGGCCTCGA AGAGGTCCAC CATGTCCTCC
GAGATCATAA ACACGATAGC GGCGGAGGGC AGGAAGTTCG GCGTGTTCCT GATCCTTGTG
ACCCAGAGGC CGAGCAAGAT AGACGCGGAC GCCCTGAGCC AGTGCAACAG CCAGATAATA
CTGAGGATCA CTAACCCGAG CGACCAGAGG GCTGTCGCCG AGGCTAGCGA GAGGCTGGGG
GAGGACCTCA TGAGGGACCT ACCTGGGCTC AACGTCGGCG AGGCCATAAT AGTTGGGGAG
CTGACCCGGG TACCCGTCGT GGTGAAGGTG AGGCGCAGGT TTACGCGGGA GGGGGGCGCC
GACATAGACC TCGTCGCAGA GCTCCGGAGA GCCCGCGAGG AGCTCAGCCT CGCGTCAAGG
ACATATAGGG GCGAGGACCT ACTATCCGAG GTGTAG
 
Protein sequence
MYSSLRQVGR IVEEATPESF IFVTTKQDHP PKYEYVLVKS REVVAGEERE VDVLCQVTGV 
VSRSDAYSSR LDLESLERIH EAGIDDANLL CAARTLGYLA EEDGRKVVLM PRRAFFPGNP
VYLAPDDFVR EFFSYPGEEG IRIGSLVSRR NVDVYLSVNG FRRHVAVIAQ TGAGKSYTVG
VILEELLRLG ATAVVIDPHA DYVFLSRDRD MRRHEYSDRV LVFRNPNSTG RYDPSQMDNV
HELTVKFSDL SAEDVARIAG IPEKWTNVRK AIRDALDKLR GRDYTIDDLL GELEKMSRGG
GKEAAYASSA YNHLVKLRKF SVFGRYTTSV KDEILKPGHV SVLDLSGLND ASQDYIVSTV
LEEIYRLRYS GEFRYPVFVV VEEAHRFVPS KASKRSTMSS EIINTIAAEG RKFGVFLILV
TQRPSKIDAD ALSQCNSQII LRITNPSDQR AVAEASERLG EDLMRDLPGL NVGEAIIVGE
LTRVPVVVKV RRRFTREGGA DIDLVAELRR AREELSLASR TYRGEDLLSE V