Gene Tpen_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1685 
Symbol 
ID4600573 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1634106 
End bp1635266 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content52% 
IMG OID639774458 
Producthypothetical protein 
Protein accessionYP_921083 
Protein GI119720588 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000183285 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGAGTTA GGGCCGAGTT TCTGAGGCTA CTGGAGGAGG ACGTGGAGTT TAGGTATGCA 
GTCGCAGGCT ACCTGGGAGT GTTAGAGGTT TTGAGGAGGC TAGACTCCCT AGCCGAGGAG
ATGAAGAGGC TCAGAGAAGA AATGCACGCG GGCTTCGCGA AGCACGGGGA AATACTTGAA
AGGCTCGAAA AGATCCTCGA GAAGCACGAA GAGGCCTTGA GAAGACACGA TGAGGAGCTA
ATCAGGCTTA GAGAGGACAT GAATAAGGGG TTTGCCAGGC ACGACGAGCA ACTCGCAAAG
CTTAGGGAAG ACATGAACTC CGGCTTCGCG AGGCACGACG AAGCATCGAG AAGGCACGAG
GAGCAACTTG TGAAGCTTAG AGAGGACTTC AACAAGCTGA GGGAGGATAT GAACAAGGGC
TTCGCTAGAC ACGACGAGGC TCTTAAAAGG CATGAAGAAG AACTCGTGAA GCTTAGGGAG
GACATGAACG CCGGCTTTGC TAGACAAGAC AAAGAGCTCA CCAGGCTTAG AGAGGACATG
AACGCCGGTT TCGCTAGGCA TGACGAGTTA TTCAGGAGGC ATGAAGAGGA GATGAAGGGA
CTACGGGAGG ATATGAATAA GGGATTTGCC AGGCACGATG AAGCGTTGAG GAGGCACGAG
GAAGAGCTAG TAAAGCTCAG AGAGGACATG AACAAAGGCT TTGCTAGACA CGACGAGGCG
CTGAGGAGGC ATGAGGAGAT TTTAGAGAAG CATAGCGAGG AGTTAGCAAA GTTGAGGAGC
GCTATGATTG CCGGTTTTGG GGAGTTAAGC AAGTTTGCGG GTATGACCTT CGAGGAATTT
GTTAGGAAGT TCCTGACCGC TTACCTCAGG GAGGCGGGCG AAGTCCCGGA AGGCTCAGAG
CTGAGGAGGG AGGTCGTAGA AGGCGAGGAG ATAGACCTCT TCCTCGAAGA ACCACTCATA
GTCGGAGAAG TAACGGCACA CGCAGAGTCC CTGGAGGAGC TTGAGAAGCT AGTCAGGAAG
GCGGAGCTGG TGAAAGCCAA GTACGGTAAA GAACCCAGGA AAATCCTAGT CATACTTACA
GCGCCCAGGG ACTTAGCAGA GAAACTTGAA AAAATTGCCG GCGAGAAGGG CGTAGAGCTC
ATAATAGGGA GAACAGCCTA G
 
Protein sequence
MGVRAEFLRL LEEDVEFRYA VAGYLGVLEV LRRLDSLAEE MKRLREEMHA GFAKHGEILE 
RLEKILEKHE EALRRHDEEL IRLREDMNKG FARHDEQLAK LREDMNSGFA RHDEASRRHE
EQLVKLREDF NKLREDMNKG FARHDEALKR HEEELVKLRE DMNAGFARQD KELTRLREDM
NAGFARHDEL FRRHEEEMKG LREDMNKGFA RHDEALRRHE EELVKLREDM NKGFARHDEA
LRRHEEILEK HSEELAKLRS AMIAGFGELS KFAGMTFEEF VRKFLTAYLR EAGEVPEGSE
LRREVVEGEE IDLFLEEPLI VGEVTAHAES LEELEKLVRK AELVKAKYGK EPRKILVILT
APRDLAEKLE KIAGEKGVEL IIGRTA