Gene Tfu_3078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_3078 
Symbol 
ID3580078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp3600559 
End bp3601851 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content72% 
IMG OID637686815 
Productdyp-type peroxidase 
Protein accessionYP_291134 
Protein GI72163477 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGAAC CAGACACGGA GCGGAAAGGC TCCTCCCGCC GAGGATTCCT CGCGGGACTG 
GGCGCGGCAG CACTCACCGG CGCAGGCATC GGCATGGCGG CAGGAGAAGT CCTCCGCCCC
CTGCTGCCCG ACTCCGACCC GGCCGCCTCC CCGGAAGCCG AGCAGCGGCT GCGCATGGCA
GCCCAACGGG CCGACGCCAC CGCAGCGCCC CAACCCGGCA TCTCCGGCCC AGCACCCGCG
TTCGTTCACG TCATCGCGCT CGACCTGGCG GAAGAAGCCC GTAAGAACCC CGACACCGCC
CGCGACAGCG CAGCCGCCGC GCTCCGGTCC TGGACCGAAC TAGCGGCCCG CCTGCACGAG
GAGAGTCCGC ACGACATCGC CGAGGGGGCC GCCTCTGCAG GGCTGCTCCC CGCCTCCCTC
ATGGTCACCG TCGGCATCGG AGGCTCCCTG CTCTCCGCGA TCGACGCGGA AGACCGCCGA
CCGGACGCGC TCGCCGACCT CCCCGAGTTC TCCACCGACG ACCTGCACCC CCGCTGGTGC
GGTGGAGACT TCATGCTCCA AGTCGGTGCG GAAGACCCCA TGGTGCTCAC CGCGGCCGTG
GAAGAACTCG TCGCCGCGGC CGCGGATGCG ACCGCGGTGC GCTGGTCTCT GCGCGGCTTC
CGGCGGACCG CCGCAGCCGC GCGCGACCCC GACGCCACCC CCCGCAACCT CATGGGGCAG
ATCGACGGCA CCGCCAACCC CGCCCAGGAC CACCCGCTGT TCGACCGGAC CATCACCGCA
CGGCCGGCCG ACAACCCCGC GCACGCCTGG ATGGACGGCG GCAGCTACCT GGTCGTGCGA
CGGATCCGCA TGCTCTTGAC CGAATGGCGG AAACTGGACG TGGCCGCCCG GGAGCGGGTG
ATCGGCCGCC GCCTCGACAC GGGAGCACCC CTCGGCAGCC GCAACGAGAC CGACCCCGTC
GTGCTCTCGG CCCGCGACGA GGAAGGGGAA CCCCTCATCC CCGAGAACGC ACACGTGCGC
CTCGCCAGCC CGGAGAACAA CCTGGGTGCC CGCATGTTCC GCCGCGGCTA CAGCTACGAC
CAGGGGTGGC GCGACGACGG CGTCCGCGAC GCCGGACTGC TCTTCATGGC CTGGCAAGGC
GACCCCGCCA CCGGGTTCAT CCCGGTGCAG CGCAGCCTCG CCGACCAGGG CGACGCCCTC
AACCGCTACA TCCGGCACGA AGGCAGCGCC CTCTTCGCCG TCCCCGCCGC CCGGGAAGGC
CGCTACCTGG GACAGGACCT GATCGAAGGA TGA
 
Protein sequence
MTEPDTERKG SSRRGFLAGL GAAALTGAGI GMAAGEVLRP LLPDSDPAAS PEAEQRLRMA 
AQRADATAAP QPGISGPAPA FVHVIALDLA EEARKNPDTA RDSAAAALRS WTELAARLHE
ESPHDIAEGA ASAGLLPASL MVTVGIGGSL LSAIDAEDRR PDALADLPEF STDDLHPRWC
GGDFMLQVGA EDPMVLTAAV EELVAAAADA TAVRWSLRGF RRTAAAARDP DATPRNLMGQ
IDGTANPAQD HPLFDRTITA RPADNPAHAW MDGGSYLVVR RIRMLLTEWR KLDVAARERV
IGRRLDTGAP LGSRNETDPV VLSARDEEGE PLIPENAHVR LASPENNLGA RMFRRGYSYD
QGWRDDGVRD AGLLFMAWQG DPATGFIPVQ RSLADQGDAL NRYIRHEGSA LFAVPAAREG
RYLGQDLIEG