Gene Tmz1t_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1969 
Symbol 
ID7084437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp2219477 
End bp2221168 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content71% 
IMG OID643698994 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_002355616 
Protein GI217970382 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00305659 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGAA TCCAGCCCCG CTCCATCCCG CAGCAAGCCT TCGCCCGCCT CGCCGACGCC 
GGCCTGCATC CGCTGCTCGC CCGCCTCTAC GCCGCGCGCG GCATCGCGCG CGCCGACGAG
CTCGACACCA GCCTGAAGAA CCTGCTGCCG CCCGAGGCGC TCACCGGCAC GGCGGAAGCC
GCGATCCTGC TCGCCGACGC CATCGAGGCC GGCGCGCGCA TGGTCATCGT CGCCGACTAC
GACTGCGACG GCGCCACCGC CTGCGCGGTG GGCGTGCGCG CGCTGCGCGC CTTCGGTGCC
GACGTGCATT ACCTCGTGCC GGACCGCGTC ACGCTCGGCT ACGGCCTCAC CCCGGCCATG
GTCGAGATCG CCGCGCGCCT CGAGCCCGAC GTGCTGATCA CCGTCGACAA TGGCATCGCC
AGCGTCGAGG GCATCGCCGC CGCGCGCGCC CATGGCATGG CCACGGTGAT CACCGACCAT
CACCTGCCCG GCGACGTGCT GCCCGAAGCC GACGTGATCG TGAACCCCAA CCAGCCCGGC
TGCGACTTCC CGAGCAAGGC GCTGGCCGGC GTGGGCGCGA TGTTCTACAC GATGCTCGCG
CTGCGCGCCG AACTGCGCGA GCGCGGCGCC TTCGCCGGCG CCAAGGAACC CAACCTTGCC
GAGCTGCTCG ACCTCGTCGC GCTCGGCACC GTGGCCGACG TGGTCAAGCT CGACCGCAAC
AACCGCATCC TGGTCGCGCA AGGCCTCGCG CGCATGCGCG CCGGGCGCCT GCAGCCCGGC
ATCCGCGCGC TGTTCGGGCT CGCCGGGCGC GACCCGGCGC GCGCCAGCAC GATGGACCTC
GGCTTCATGA TCGGCCCTCG CCTCAACGCG GCCGGGCGAC TCTCCGACAT GAGCCTGGGC
ATCGAGTGCC TGATCACCGA CGACCCCGGC CGGGCGATGA ACATCGCCCA GGAGCTCGAC
AAGCTCAACC GCGAACGCCG CAGCATCGAG GCCGGCATGC AGGAAGAAGC GCTCGCCCGC
CTGGCGGGTT TCGACGCCGG CAACCGCGCC ACCGTGGCGC TCTTCGAACC CGACTGGCAC
CAAGGTGTGA TCGGCATCGT CGCCGGCCGC ATCAAGGAGC GGCTGCACCG CCCCACCATC
GCCTTCGCGC GCGCGAGCGA CGGCGAACTC AAGGGCTCCG GGCGCTCGAT CCCCGGCCTG
CACCTGCGCG ACGCGCTCGA CCTCGTCACC AAGCGCCAGC CCGACCTCAT CGTGCGCTTC
GGCGGCCACG CCATGGCGGC CGGCCTGACC ATCCGCGAAT CGGAACTCGC ACGCTTCGAC
GCGGCCTTCG AGGAGGTCGT CGGCGAACTG CTCGAGCCCG CCCAGCTCGA GCGCCGCATC
GATACCGACG GCAGCCTGGA ATCGGGCTAC TTCGCGCTCG ACGCCGCGCG CATGCTCGAC
AACGAGATCT GGGGCCAGGG CTTCCCGGCG CCGCTGTTCG ACGACGTCTT CCGCGTCGAG
CGCCAGCGCC TGTTGAAGGA CAAGCACCTC AAGCTCGAGC TCGCGCGCGG CAGCACGCGC
TACGAGGCCA TCCGCTTCAA CCACGCCGAG GGCGCCGCCG GCCAGATCCA CGCCGCCTTC
CGCCTCGGCA TCAATGAATA CAACGGCGTC GCCAGCGTGC AGCTGATGCT CGAACATTTC
GAGGCGGCGT AG
 
Protein sequence
MTRIQPRSIP QQAFARLADA GLHPLLARLY AARGIARADE LDTSLKNLLP PEALTGTAEA 
AILLADAIEA GARMVIVADY DCDGATACAV GVRALRAFGA DVHYLVPDRV TLGYGLTPAM
VEIAARLEPD VLITVDNGIA SVEGIAAARA HGMATVITDH HLPGDVLPEA DVIVNPNQPG
CDFPSKALAG VGAMFYTMLA LRAELRERGA FAGAKEPNLA ELLDLVALGT VADVVKLDRN
NRILVAQGLA RMRAGRLQPG IRALFGLAGR DPARASTMDL GFMIGPRLNA AGRLSDMSLG
IECLITDDPG RAMNIAQELD KLNRERRSIE AGMQEEALAR LAGFDAGNRA TVALFEPDWH
QGVIGIVAGR IKERLHRPTI AFARASDGEL KGSGRSIPGL HLRDALDLVT KRQPDLIVRF
GGHAMAAGLT IRESELARFD AAFEEVVGEL LEPAQLERRI DTDGSLESGY FALDAARMLD
NEIWGQGFPA PLFDDVFRVE RQRLLKDKHL KLELARGSTR YEAIRFNHAE GAAGQIHAAF
RLGINEYNGV ASVQLMLEHF EAA