Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_0071 |
Symbol | |
ID | 4601779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | - |
Start bp | 54750 |
End bp | 56405 |
Gene Length | 1656 bp |
Protein Length | 551 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 639772825 |
Product | radical SAM domain-containing protein |
Protein accession | YP_919484 |
Protein GI | 119718989 |
COG category | [C] Energy production and conversion |
COG ID | [COG1032] Fe-S oxidoreductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCGCG TAGACGTTGC CTTGATTCAC GCTCCAAGCG TGTACGACTT CCGCGAGCGC CCCTACGTCC ACTACGGACC CATAAGCGAC GTGATACCCT CTAAGCCTGT CTTCGACATG TACCCCGCCG GCTTCTTCTC CCTGGCAAGC TACCTGGAGG AAAGGGGGGT TAAAACCGGG ATATTCAACT TAGCGGCTAA AATGGTGAAC GACCCCCGCT TCGACGTTCC ACGCTTCCTC AGATCCCTCG AGGCAAGCGT GTACGGGATA GACCTGCACT GGCTGGTGCA CGCCCACGGA GCCCTCGAGA TTGCGAGGCT AGTCAAGGAG CTGAGGAAGG GGCACGTCGT GCTGGGAGGC TTCTCAGCGA CCTATTACTG GAGGGAAATC CTGGAGAAGT TCCCGTATGT CGACGCAATA GTCCTCGGGG ACACCACGGA GCCAGTCTTC TTCGAAGTAG TGCAAGCGCT GGAGGCGGGG CGCCTCGATA AGCTCGGGGA GGTGCCTAAC TTAGCCTACA GAGACGAGAA CGGCAGAGTG AGGTTCAACG GTCTAAGGTA CGTGCCGGTA GAGCTTGACG AGCTCAGACC AAAGTACGAC ATCGTCGTAA AGGTGATGGT GAGGAGCGGT ATAACGTACT CCATACCTTG GAGCACTTTC CTCAAGCACC CTGTAACCGC GGTTATAACG TACAAGGGTT GCACGTTCAA CTGCCTAGCC TGCGGAGGAA GCAGGTTCAC GTACAACGTG ATCTACGGGA GGAGGAAGCT GGGCGTCAAG AAGCCGGAAA CGCTTTTCGA AGAGTACAAG GAGATAACCG AGAGGCTGAA GGCTCCTATA TTCTTCGTCA ACGACCTCCA AGTATTAGGG AAAAGCTACG TAGAGCGACT AGTAAGCCTC CTGAGAAGTG AGAGGGCAGG CGTAGAGGTA TTCTTCGAGT TCTTCACGCC GCCTCCAAGG GACTTCCTCG CAGTACTGAG AAGCGCCGAG GAAAGGGTTT ACCTCCAGAT CTCGCCCGAG ACGCACGACG AGAGTATCCG GTCAACGTAC GGGAGGCCCT ACACGAACAG CTCGCTGAAG GCTTTCCTAA GAAACGCGGA GGATCTAGGC TTCACGAGGG TAGACCTCTA CTTCATGGTA GGGCTACCGG GGCAAACCCC TGAAAACGTT AAGGGTATAG GTAGCTTCTT CGAAGAACTC AGACGCATTG CCCCAAAAGT CGTAGACGCC TTCGTAGCGC CACTAGCGCC CTTCGTTGAC CCGGGAAGCC CGGCCTTCCA CATGTCCGGC AAGTACGGGT ACCGCTTATT CGCGTATACA CTCTCGGACC ACAGGAAGCT CCTACTCGCG GATAAGTGGT ACCTAATGCT CAACTACGAG ACTAGGTGGA TGACGCGGGC AGAGATAGCG GCGGCAACGT ACAACGCCGT TGAGAGCCTT GCGACAAGCA AGTACAGGGC CGGAGTCATA GACGAGGAGT ACTTCAGGGA GGTGATGGAG TCCATCCAGC TGGCCAGGAG AGGCGGAAGG CCGGAAATCC TGGACTCCAA GGAAACTCTC AGAGAAGAGG AACTCTACCC CATGAAGGCG CTCAACCTGT CCTACCTAAC GCCAAAGGTG ATCCTCGAGA TAGCGAAGTA CATGGTTAGA AGCTAG
|
Protein sequence | MKRVDVALIH APSVYDFRER PYVHYGPISD VIPSKPVFDM YPAGFFSLAS YLEERGVKTG IFNLAAKMVN DPRFDVPRFL RSLEASVYGI DLHWLVHAHG ALEIARLVKE LRKGHVVLGG FSATYYWREI LEKFPYVDAI VLGDTTEPVF FEVVQALEAG RLDKLGEVPN LAYRDENGRV RFNGLRYVPV ELDELRPKYD IVVKVMVRSG ITYSIPWSTF LKHPVTAVIT YKGCTFNCLA CGGSRFTYNV IYGRRKLGVK KPETLFEEYK EITERLKAPI FFVNDLQVLG KSYVERLVSL LRSERAGVEV FFEFFTPPPR DFLAVLRSAE ERVYLQISPE THDESIRSTY GRPYTNSSLK AFLRNAEDLG FTRVDLYFMV GLPGQTPENV KGIGSFFEEL RRIAPKVVDA FVAPLAPFVD PGSPAFHMSG KYGYRLFAYT LSDHRKLLLA DKWYLMLNYE TRWMTRAEIA AATYNAVESL ATSKYRAGVI DEEYFREVME SIQLARRGGR PEILDSKETL REEELYPMKA LNLSYLTPKV ILEIAKYMVR S
|
| |