Gene Tmz1t_1227 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1227 
Symbol 
ID7083887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1361288 
End bp1363036 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content69% 
IMG OID643698243 
Productprolyl-tRNA synthetase 
Protein accessionYP_002354882 
Protein GI217969648 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.463936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCCA GCCAGTTCCA TCTCTTCACC CTCAAGGAAG CCCCCTCCGA CGCCGAGGTC 
GTCAGCCAGA AGCTGATGCT GCGCGCCGGC ATGATCCGCA AGGTCGCCGC CGGCATCTAC
AGCTACATGC CGATGGGCCT GCGCGCGATC CGCAAGGTCG AGGCCATCGT GCGCGAGGAG
ATGGACCGCG CCGGCGCCAT GGAACTGATC ATGCCGATGG TGCAGCCCGC CGAGCTGTGG
GACGAGACCG GGCGCTGGGA CAAGATGGGC GACGAGCTGC TGCGCTTCAA GGACCGCCAC
GAGCGCGACT TCGCGCTGCA ACCCACCTCC GAGGAGGTCG TCACCGACAT CGCGCGCCAG
GAGCTCAAGA GCTACCGCCA GCTGCCGAAG AACTTCTACC AGATCCAGAC CAAGTTCCGC
GACGAACGCC GGCCGCGCTT CGGCGTCATG CGCGGGCGCG AATTCACCAT GAAGGACGCC
TACTCCTTCG ACCGCAGCGA GGAAGCCGCC GGGCGCAGCT ACGACATCAT GTTCGCCGCC
TACAAGCGCA TCTTCGACCG CCTCGGCCTC GAATACCGCG CGGTGGCGGC CGACACCGGC
GCCATCGGCG GCGACCGCTC GCACGAGTTC CAGGTCATCG CCGACACCGG CGAGGACGCC
ATCGTCTACT GCCCCGACTC CGACTACGCT GCCAACATCG AGCTCGCCGA GGCGGTGTCC
CTGCTCGCCC GCCGCGCAGA GCCCGCCCGG GCGCTGGCAA AAACCCCCAC ACCGGGCAAG
GCGACCTGCG AGGACGTCGC CGCGCTGCTG GGCGTGCCGC TGGCGACCAC GGTGAAGTCG
CTGGTACTCG CCACCGACGA TGTCGATGAA AAGGGCAAGC CCGCCGGCGT CACCGTGTGG
CTGCTGCTGG TGCGCGGCGA CCACGCCCTC AACGAGGTCA AGGCCGGCAA GCTGGAAGGC
CTCAAGGCAG GCTTCCGCTT CGCCACCGAG GCCGAGATCC TCGAACACTT CGGCTGCAAG
CCGGGCTACC TCGGCCCCAT CGGTCTGAAG AAGCCGGTCA AGGTCATCGC CGACCGCACG
GTCGCCCACA TGGCCGACTT CATCTGCGGC GCCAACGAGG CCGACTTCCA CTTCACCGGC
GTGAACTGGG GCCGCGACCT GCCCGAGCCC GACCAGGTCG CCGACCTGCG CAACGTCGTC
GAGGGCGACC CCAGCCCGGA CGGCCGCGGC GTGCTCGCCA TCCAGCGCGG CATCGAGGTC
GGTCACGTGT TCTACCTCGG CACCAAGTAC TCCAAGGCGA TGAACGCCAC CTTCCTCGAC
GAGGACGGCA AGCCCAAGCA CTTCGAAATG GGCTGCTACG GCATCGGCGT GACCCGCATC
CTGGGCGCCG CGATCGAGCA GAACCACGAC GCCCGCGGCA TCGTCTGGCC GCGCGCGATC
GCACCCTTCG AGGTGGTGGT CTGCCCGGTC GGCTGGGGCA AGTCGCAGGC GGTGCGCGAC
GAGGCGCAGA AGCTCTACGA CGCGCTCGTC GCCACCGGCG TCGACGCCAT CCTCGACGAC
CGCGACGAGC GCCCGGGCGT GATGTTCGCC GACTGGGAAC TGATCGGCGT GCCGCACCGC
GTCACCATCG GCGACCGCGG GCTCAAGGAA GGCGTGATCG AATACCAGGG CCGGCGCGAT
GCCGAAGCCG CCAAGCTGCC CGCAGGCGAG GTCCTCGGCC ACGTGCTCGA GCGCCTGAAC
CAGGGCTGA
 
Protein sequence
MRASQFHLFT LKEAPSDAEV VSQKLMLRAG MIRKVAAGIY SYMPMGLRAI RKVEAIVREE 
MDRAGAMELI MPMVQPAELW DETGRWDKMG DELLRFKDRH ERDFALQPTS EEVVTDIARQ
ELKSYRQLPK NFYQIQTKFR DERRPRFGVM RGREFTMKDA YSFDRSEEAA GRSYDIMFAA
YKRIFDRLGL EYRAVAADTG AIGGDRSHEF QVIADTGEDA IVYCPDSDYA ANIELAEAVS
LLARRAEPAR ALAKTPTPGK ATCEDVAALL GVPLATTVKS LVLATDDVDE KGKPAGVTVW
LLLVRGDHAL NEVKAGKLEG LKAGFRFATE AEILEHFGCK PGYLGPIGLK KPVKVIADRT
VAHMADFICG ANEADFHFTG VNWGRDLPEP DQVADLRNVV EGDPSPDGRG VLAIQRGIEV
GHVFYLGTKY SKAMNATFLD EDGKPKHFEM GCYGIGVTRI LGAAIEQNHD ARGIVWPRAI
APFEVVVCPV GWGKSQAVRD EAQKLYDALV ATGVDAILDD RDERPGVMFA DWELIGVPHR
VTIGDRGLKE GVIEYQGRRD AEAAKLPAGE VLGHVLERLN QG