Gene Tpen_1318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1318 
Symbol 
ID4601998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1265808 
End bp1267571 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content68% 
IMG OID639774093 
Producthypothetical protein 
Protein accessionYP_920718 
Protein GI119720223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00353025 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAGGTGG AGGAGCTCTC AGCCGTCGGC GCGAGGCCGG GGTGCTTCTT CTCGGCGGAC 
TTCGTGAACA TCACCCCGTG GTACGGCGGC AGGCACACCC AGGACGCTGT GAGGTGCCTG
GACGAGAGGT GCACGAAAGC CTACTACTCG CTCCCTACGG CCAGGAGCGT GAAGGGGCTG
CTGAGGTGGC TTACCAGAGC GGTGGTAGCG AGCTTCGTCC CCGACGACCA GCTGGCCAGC
CACGGCTACG CCGCGGTCGA GTGCTTCCCG AACTGCGGGT CCAGCAAGCC GGGCCTCGTC
GAGGCGATCT TCGGAACCGT GGAGCACGCG AGGCCCGGGG GGCAACGCGT AGGGTCCAGG
GCGGGCGCCC TCTCGGTCGT CGTGAAGCCG AAGCTGAACT GCCGCTCACC CGTCTACGCC
GAGTACCAGG ACGTGCTAAA GCTGATCAAG AGCATAGCCG GGGGTAAAGG GTGGGGCGTT
TACTCGCAGA AGCCGAGCGC GCTGCTACAG GAGCTCGAGG ACCGGGGCTT CAGGGCGCTG
AGGGGGGAGG CGAGGCCGGA GGACAAGGCG GCGGGGTTCG CCGAGCTGTT CACAGTCCCG
CGCGTACTGC TGAACGCCCA GAGGCTCGGC AAGCTGAGGG GCAAAAGGCA GGAGGAGTTC
GCGAGGAGCC TCTTCGAGGT CCAGCCGCTC AGGGAGGGGT GCGTCTCCAT GCGCGTCGAG
CTCTACCTCG ACGGGGACAT GCTCTCCGGG GCCCTGGAGC CCGCGCGGGG GGAGGAGCTC
GCGGAGAGCG TGAAGCGGCT CGAGGAGCTC CTACTGGTCT ACGGGCTACT GCTCTTCGGA
ATCGGGAAGG CTTCGAGCAG GGGCTTCGGC AGGTTCGCCC CCAAGTCGCC GAGGGGCAAC
GTGCACCCGC TGGTCGAGAA GGCCGCCGCG CGCCTCGAGG AGAGAGACCT CGAGGGCTTC
AGGGAGGAGT GCCTGGGGCT GGCTGAGCGG GCGCTGAGGG CCCTCGGCGT CGAGGCGGAG
GCGAGGAGGA CGGTGGCGAG CGTCCCGAGG ATCTCGAACG CCGAGGTAAC GCTGATCGAG
AGGCCGGCGC ACCCGTACCC GTACGCATCC AGGGAGGCCG CGAGGGTTAA GCCCTCCAAG
AAGCCATGCT CGGGGGACGT GCTGTGCGTG CTGAGCGCCG TAGGCAAGGC CACGCTGAAG
AGCACGTGGA AGGCCTACTG GCAGTCGATC TCCGGGGCTG AATGGGGCGT GACGGGCCCG
GGCTTCCCGT TCCACACGTG GGGGCTGGGC CTGCCGAGAG CCGTGTGCAA GGGAAACTCC
TGCACGGGCT ACGTCGTCGT CGACGCGGAG AGCCTTGGAG GCGCGCAGGG AGACGTGGAC
TACTGCTTGC AACGCCTCAG CTTTAGGAAC GACCTGAAGA GGTGGAAGTC GCCGCTCGTG
CTGTCCCCCG TCCCCGCGGG CAACGGGCTC GGGGTAGCCG TCGTCCTGCT CAAGCGCCTA
GACATCAAGC CGTTCCTGAG CCCCGAGGCG CGGAGAGCCG TCCTCGCCCA CGTCGGTATA
CACCAGGGTA GCAGATACTT GCACGTGATC GACGTTGGGA GGGGCGCGTC GACGACCGGC
TGGACCGAGG ACTGCGGGTC GGACCCCCTC GGCGCCGCCG ACGTCTCTCA GAGGAGGGTC
GTCGCGCTCC CCGGGGACGC GGCGGAGCTC CTCGTGAAAG TCCAGGAGCT CGCGAGGGAC
TGGGTTGTGT ACCTGCTGAG GTGA
 
Protein sequence
MEVEELSAVG ARPGCFFSAD FVNITPWYGG RHTQDAVRCL DERCTKAYYS LPTARSVKGL 
LRWLTRAVVA SFVPDDQLAS HGYAAVECFP NCGSSKPGLV EAIFGTVEHA RPGGQRVGSR
AGALSVVVKP KLNCRSPVYA EYQDVLKLIK SIAGGKGWGV YSQKPSALLQ ELEDRGFRAL
RGEARPEDKA AGFAELFTVP RVLLNAQRLG KLRGKRQEEF ARSLFEVQPL REGCVSMRVE
LYLDGDMLSG ALEPARGEEL AESVKRLEEL LLVYGLLLFG IGKASSRGFG RFAPKSPRGN
VHPLVEKAAA RLEERDLEGF REECLGLAER ALRALGVEAE ARRTVASVPR ISNAEVTLIE
RPAHPYPYAS REAARVKPSK KPCSGDVLCV LSAVGKATLK STWKAYWQSI SGAEWGVTGP
GFPFHTWGLG LPRAVCKGNS CTGYVVVDAE SLGGAQGDVD YCLQRLSFRN DLKRWKSPLV
LSPVPAGNGL GVAVVLLKRL DIKPFLSPEA RRAVLAHVGI HQGSRYLHVI DVGRGASTTG
WTEDCGSDPL GAADVSQRRV VALPGDAAEL LVKVQELARD WVVYLLR