Gene Tpen_0071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_0071 
Symbol 
ID4601779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp54750 
End bp56405 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content56% 
IMG OID639772825 
Productradical SAM domain-containing protein 
Protein accessionYP_919484 
Protein GI119718989 
COG category[C] Energy production and conversion 
COG ID[COG1032] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCGCG TAGACGTTGC CTTGATTCAC GCTCCAAGCG TGTACGACTT CCGCGAGCGC 
CCCTACGTCC ACTACGGACC CATAAGCGAC GTGATACCCT CTAAGCCTGT CTTCGACATG
TACCCCGCCG GCTTCTTCTC CCTGGCAAGC TACCTGGAGG AAAGGGGGGT TAAAACCGGG
ATATTCAACT TAGCGGCTAA AATGGTGAAC GACCCCCGCT TCGACGTTCC ACGCTTCCTC
AGATCCCTCG AGGCAAGCGT GTACGGGATA GACCTGCACT GGCTGGTGCA CGCCCACGGA
GCCCTCGAGA TTGCGAGGCT AGTCAAGGAG CTGAGGAAGG GGCACGTCGT GCTGGGAGGC
TTCTCAGCGA CCTATTACTG GAGGGAAATC CTGGAGAAGT TCCCGTATGT CGACGCAATA
GTCCTCGGGG ACACCACGGA GCCAGTCTTC TTCGAAGTAG TGCAAGCGCT GGAGGCGGGG
CGCCTCGATA AGCTCGGGGA GGTGCCTAAC TTAGCCTACA GAGACGAGAA CGGCAGAGTG
AGGTTCAACG GTCTAAGGTA CGTGCCGGTA GAGCTTGACG AGCTCAGACC AAAGTACGAC
ATCGTCGTAA AGGTGATGGT GAGGAGCGGT ATAACGTACT CCATACCTTG GAGCACTTTC
CTCAAGCACC CTGTAACCGC GGTTATAACG TACAAGGGTT GCACGTTCAA CTGCCTAGCC
TGCGGAGGAA GCAGGTTCAC GTACAACGTG ATCTACGGGA GGAGGAAGCT GGGCGTCAAG
AAGCCGGAAA CGCTTTTCGA AGAGTACAAG GAGATAACCG AGAGGCTGAA GGCTCCTATA
TTCTTCGTCA ACGACCTCCA AGTATTAGGG AAAAGCTACG TAGAGCGACT AGTAAGCCTC
CTGAGAAGTG AGAGGGCAGG CGTAGAGGTA TTCTTCGAGT TCTTCACGCC GCCTCCAAGG
GACTTCCTCG CAGTACTGAG AAGCGCCGAG GAAAGGGTTT ACCTCCAGAT CTCGCCCGAG
ACGCACGACG AGAGTATCCG GTCAACGTAC GGGAGGCCCT ACACGAACAG CTCGCTGAAG
GCTTTCCTAA GAAACGCGGA GGATCTAGGC TTCACGAGGG TAGACCTCTA CTTCATGGTA
GGGCTACCGG GGCAAACCCC TGAAAACGTT AAGGGTATAG GTAGCTTCTT CGAAGAACTC
AGACGCATTG CCCCAAAAGT CGTAGACGCC TTCGTAGCGC CACTAGCGCC CTTCGTTGAC
CCGGGAAGCC CGGCCTTCCA CATGTCCGGC AAGTACGGGT ACCGCTTATT CGCGTATACA
CTCTCGGACC ACAGGAAGCT CCTACTCGCG GATAAGTGGT ACCTAATGCT CAACTACGAG
ACTAGGTGGA TGACGCGGGC AGAGATAGCG GCGGCAACGT ACAACGCCGT TGAGAGCCTT
GCGACAAGCA AGTACAGGGC CGGAGTCATA GACGAGGAGT ACTTCAGGGA GGTGATGGAG
TCCATCCAGC TGGCCAGGAG AGGCGGAAGG CCGGAAATCC TGGACTCCAA GGAAACTCTC
AGAGAAGAGG AACTCTACCC CATGAAGGCG CTCAACCTGT CCTACCTAAC GCCAAAGGTG
ATCCTCGAGA TAGCGAAGTA CATGGTTAGA AGCTAG
 
Protein sequence
MKRVDVALIH APSVYDFRER PYVHYGPISD VIPSKPVFDM YPAGFFSLAS YLEERGVKTG 
IFNLAAKMVN DPRFDVPRFL RSLEASVYGI DLHWLVHAHG ALEIARLVKE LRKGHVVLGG
FSATYYWREI LEKFPYVDAI VLGDTTEPVF FEVVQALEAG RLDKLGEVPN LAYRDENGRV
RFNGLRYVPV ELDELRPKYD IVVKVMVRSG ITYSIPWSTF LKHPVTAVIT YKGCTFNCLA
CGGSRFTYNV IYGRRKLGVK KPETLFEEYK EITERLKAPI FFVNDLQVLG KSYVERLVSL
LRSERAGVEV FFEFFTPPPR DFLAVLRSAE ERVYLQISPE THDESIRSTY GRPYTNSSLK
AFLRNAEDLG FTRVDLYFMV GLPGQTPENV KGIGSFFEEL RRIAPKVVDA FVAPLAPFVD
PGSPAFHMSG KYGYRLFAYT LSDHRKLLLA DKWYLMLNYE TRWMTRAEIA AATYNAVESL
ATSKYRAGVI DEEYFREVME SIQLARRGGR PEILDSKETL REEELYPMKA LNLSYLTPKV
ILEIAKYMVR S