Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2243 |
Symbol | |
ID | 7083675 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2526470 |
End bp | 2527510 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643699262 |
Product | selenophosphate synthetase |
Protein accession | YP_002355878 |
Protein GI | 217970644 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0709] Selenophosphate synthase |
TIGRFAM ID | [TIGR00476] selenium donor protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.466152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGAAA TCAGACTGAC CGAATTCTCC CACGGCGGCG GCTGCGGCTG CAAAATCGCC CCCGCCGTGC TCGCCGAGCT GCTCGCCACC ATGCCCGCCG GCCTCGTGCC ACCCGAGCTG CTGGTGGGGA CCGACACTGC CGACGACGCC GCCGTCTACA AGCTCAACGA TCGCCAGGCC ATCGTCGCCA CCACCGACTT CTTCACCCCG ATCGTGGACA ACCCGCGCGA CTTCGGCCGC ATCGCCGCCA CCAACGCGCT CTCCGACGTC TATGCGATGG GCGCCCAGCC CATCCTGGCG CTGGCGATCG TCGGCATGCC GCTCGACAAG CTGCCGCCCG CGGTGATCGG CGAGATCCTC AAGGGCGGCG CCGAGGTCTG CCGCGACGCG GGCATCCCGG TCGCCGGCGG GCACTCGATC GACGTGCTCG AGCCGATCTA CGGTCTGGTC GGCCTCGGCG TGGTCGACCC CGCGCGCGTG CGCACCAACG CCGGCGCCAA GGCGGGCGAC GTGCTCATCC TCACCAAGCC GCTCGGCATC GGCATCCTCT CGGCGGGCTT GAAGAAGGGC CGGCTGTCGG CAGACGGTTA CGCGCAGATG ATCCGCTGGA CGACGACGCT CAACCGTGTC GGCGCCAGGC TCGCCGACCT CGACGGCGTG CACGCGGTGA CCGACGTCAC CGGCTTCGGC CTCGCGGGGC ACCTGCTGGA GATGTGCCGC GGCGCGTCGC TGACCGGCGC GGTGCGCTTC GATGCGCTGC CGGTGATCGA GGAGGCGCGC GCCCTGGTCC GGGACGGCGT CGCCACCGGC GCGTCGACGC GCAACTGGGC GAGCTACGGT GCCAGCGTGG AGCTGCCGGT GGACGCCCCG GAGTGGCAGC GCAAGCTCGT CACCGACCCG CAGACCTCCG GCGGCCTGCT GATCGCCTGC GCGCCGGAAG CCGTCGAGGC CGTGCAGGCC GCGGTGCGCG CCGAGCAGGG CGAGGCGGGG ACGATCGTCG GCGAGATGAA GGCGGGCGCA GCGCGGGTGG TGGTGGGGTA G
|
Protein sequence | MNEIRLTEFS HGGGCGCKIA PAVLAELLAT MPAGLVPPEL LVGTDTADDA AVYKLNDRQA IVATTDFFTP IVDNPRDFGR IAATNALSDV YAMGAQPILA LAIVGMPLDK LPPAVIGEIL KGGAEVCRDA GIPVAGGHSI DVLEPIYGLV GLGVVDPARV RTNAGAKAGD VLILTKPLGI GILSAGLKKG RLSADGYAQM IRWTTTLNRV GARLADLDGV HAVTDVTGFG LAGHLLEMCR GASLTGAVRF DALPVIEEAR ALVRDGVATG ASTRNWASYG ASVELPVDAP EWQRKLVTDP QTSGGLLIAC APEAVEAVQA AVRAEQGEAG TIVGEMKAGA ARVVVG
|
| |