Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1318 |
Symbol | |
ID | 4601998 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 1265808 |
End bp | 1267571 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639774093 |
Product | hypothetical protein |
Protein accession | YP_920718 |
Protein GI | 119720223 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00353025 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAGGTGG AGGAGCTCTC AGCCGTCGGC GCGAGGCCGG GGTGCTTCTT CTCGGCGGAC TTCGTGAACA TCACCCCGTG GTACGGCGGC AGGCACACCC AGGACGCTGT GAGGTGCCTG GACGAGAGGT GCACGAAAGC CTACTACTCG CTCCCTACGG CCAGGAGCGT GAAGGGGCTG CTGAGGTGGC TTACCAGAGC GGTGGTAGCG AGCTTCGTCC CCGACGACCA GCTGGCCAGC CACGGCTACG CCGCGGTCGA GTGCTTCCCG AACTGCGGGT CCAGCAAGCC GGGCCTCGTC GAGGCGATCT TCGGAACCGT GGAGCACGCG AGGCCCGGGG GGCAACGCGT AGGGTCCAGG GCGGGCGCCC TCTCGGTCGT CGTGAAGCCG AAGCTGAACT GCCGCTCACC CGTCTACGCC GAGTACCAGG ACGTGCTAAA GCTGATCAAG AGCATAGCCG GGGGTAAAGG GTGGGGCGTT TACTCGCAGA AGCCGAGCGC GCTGCTACAG GAGCTCGAGG ACCGGGGCTT CAGGGCGCTG AGGGGGGAGG CGAGGCCGGA GGACAAGGCG GCGGGGTTCG CCGAGCTGTT CACAGTCCCG CGCGTACTGC TGAACGCCCA GAGGCTCGGC AAGCTGAGGG GCAAAAGGCA GGAGGAGTTC GCGAGGAGCC TCTTCGAGGT CCAGCCGCTC AGGGAGGGGT GCGTCTCCAT GCGCGTCGAG CTCTACCTCG ACGGGGACAT GCTCTCCGGG GCCCTGGAGC CCGCGCGGGG GGAGGAGCTC GCGGAGAGCG TGAAGCGGCT CGAGGAGCTC CTACTGGTCT ACGGGCTACT GCTCTTCGGA ATCGGGAAGG CTTCGAGCAG GGGCTTCGGC AGGTTCGCCC CCAAGTCGCC GAGGGGCAAC GTGCACCCGC TGGTCGAGAA GGCCGCCGCG CGCCTCGAGG AGAGAGACCT CGAGGGCTTC AGGGAGGAGT GCCTGGGGCT GGCTGAGCGG GCGCTGAGGG CCCTCGGCGT CGAGGCGGAG GCGAGGAGGA CGGTGGCGAG CGTCCCGAGG ATCTCGAACG CCGAGGTAAC GCTGATCGAG AGGCCGGCGC ACCCGTACCC GTACGCATCC AGGGAGGCCG CGAGGGTTAA GCCCTCCAAG AAGCCATGCT CGGGGGACGT GCTGTGCGTG CTGAGCGCCG TAGGCAAGGC CACGCTGAAG AGCACGTGGA AGGCCTACTG GCAGTCGATC TCCGGGGCTG AATGGGGCGT GACGGGCCCG GGCTTCCCGT TCCACACGTG GGGGCTGGGC CTGCCGAGAG CCGTGTGCAA GGGAAACTCC TGCACGGGCT ACGTCGTCGT CGACGCGGAG AGCCTTGGAG GCGCGCAGGG AGACGTGGAC TACTGCTTGC AACGCCTCAG CTTTAGGAAC GACCTGAAGA GGTGGAAGTC GCCGCTCGTG CTGTCCCCCG TCCCCGCGGG CAACGGGCTC GGGGTAGCCG TCGTCCTGCT CAAGCGCCTA GACATCAAGC CGTTCCTGAG CCCCGAGGCG CGGAGAGCCG TCCTCGCCCA CGTCGGTATA CACCAGGGTA GCAGATACTT GCACGTGATC GACGTTGGGA GGGGCGCGTC GACGACCGGC TGGACCGAGG ACTGCGGGTC GGACCCCCTC GGCGCCGCCG ACGTCTCTCA GAGGAGGGTC GTCGCGCTCC CCGGGGACGC GGCGGAGCTC CTCGTGAAAG TCCAGGAGCT CGCGAGGGAC TGGGTTGTGT ACCTGCTGAG GTGA
|
Protein sequence | MEVEELSAVG ARPGCFFSAD FVNITPWYGG RHTQDAVRCL DERCTKAYYS LPTARSVKGL LRWLTRAVVA SFVPDDQLAS HGYAAVECFP NCGSSKPGLV EAIFGTVEHA RPGGQRVGSR AGALSVVVKP KLNCRSPVYA EYQDVLKLIK SIAGGKGWGV YSQKPSALLQ ELEDRGFRAL RGEARPEDKA AGFAELFTVP RVLLNAQRLG KLRGKRQEEF ARSLFEVQPL REGCVSMRVE LYLDGDMLSG ALEPARGEEL AESVKRLEEL LLVYGLLLFG IGKASSRGFG RFAPKSPRGN VHPLVEKAAA RLEERDLEGF REECLGLAER ALRALGVEAE ARRTVASVPR ISNAEVTLIE RPAHPYPYAS REAARVKPSK KPCSGDVLCV LSAVGKATLK STWKAYWQSI SGAEWGVTGP GFPFHTWGLG LPRAVCKGNS CTGYVVVDAE SLGGAQGDVD YCLQRLSFRN DLKRWKSPLV LSPVPAGNGL GVAVVLLKRL DIKPFLSPEA RRAVLAHVGI HQGSRYLHVI DVGRGASTTG WTEDCGSDPL GAADVSQRRV VALPGDAAEL LVKVQELARD WVVYLLR
|
| |