Gene Namu_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3098 
Symbol 
ID8448712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3419124 
End bp3420125 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content77% 
IMG OID645042179 
Productthioredoxin 
Protein accessionYP_003202420 
Protein GI258653264 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000032408 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000022841 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCCGCC CCACCCCCCG ACCGCAGCCG CCGGCGGCGA TGAGCGCCGC GTTCGCCGGC 
GCGGTCGACC TGTCCGCCCT CAAGAGCCGG GCGGCCGGCC CGGCCGGACC CGGCGCCGCC
GGACCCGCGG CTCCCGGGGC ACCTGCCGGT GCCCCCGCGG CCGGGACGGG GGAGCCCTCG
CCCTACATCG TCGACGTCGA CGAGCCCACG TTCGGCAGCC TGGTCCAGGC CTCCACCCAG
CTGCCGATCA TCCTGAACTT CGAGGCCGCC TGGGCCGAAC CGAGCCTGGC GCTCTCGGCC
GCCCTGAGCA AGCTGGCGGC GGCCGGCGGC GGGGCCTGGA TCCTGGGTCG GGTCGATGTC
GACGCCAACC CGCGGATCGC CCAGGCCCTG CAGGTGCAGA CGCTGCCGTT GGCGGTCGTC
CTGGTGCAGG GTCAGCCGGC CGCCGAGGTG CCCGGGGTCG CGTCCGAACC GCAGCTGCGG
CAGTGGATCG CCTCGCTGCT CGATCAGCTG CGCGAGCACC TGCCGGGCAT CGCCCAGGCC
GAGGCCCGGC TGGCCGCCGA GGGTGGCGGC GCGCAGGAGG AGCCGGAGCC CGAGGATCCG
CGGTTCGTCG CCGCGGAGGA GGCGCTGGCC GAAGGGGACT ACGCGGCCGC CGAACTGGCC
TACCAGCAGA TCCTGGCGGT GGAGCCGGCC AACGCCGAGG CCAAGGCGGC GCTGGCCCAG
GTCGGTCTGC TCGCCCGGGT GGACAGCCTG CCGCCGGATG CGATCGCCGC CGCGGATGCC
GCGCCCGACG ACGTCGAGCT GCAGAAGGGC GCCGCCGACG CCGAGTTGGC CGCGGGGCAG
GCCGGGGCGG CCTTCGCTCG ACTGATCGCC ACGGTTCGCC GGACCGCCGG GGACGAGCGG
ACCGCCGCCC GCGAGCACCT GGTCGAGCTG TTCGGCCTGT TCGCGCCGGA CGATCCCGAA
GTGATCAAGG CCCGGCGCGC ACTGGCCGCC GCCCTGTACT GA
 
Protein sequence
MTRPTPRPQP PAAMSAAFAG AVDLSALKSR AAGPAGPGAA GPAAPGAPAG APAAGTGEPS 
PYIVDVDEPT FGSLVQASTQ LPIILNFEAA WAEPSLALSA ALSKLAAAGG GAWILGRVDV
DANPRIAQAL QVQTLPLAVV LVQGQPAAEV PGVASEPQLR QWIASLLDQL REHLPGIAQA
EARLAAEGGG AQEEPEPEDP RFVAAEEALA EGDYAAAELA YQQILAVEPA NAEAKAALAQ
VGLLARVDSL PPDAIAAADA APDDVELQKG AADAELAAGQ AGAAFARLIA TVRRTAGDER
TAAREHLVEL FGLFAPDDPE VIKARRALAA ALY