Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0492 |
Symbol | |
ID | 7085003 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 554473 |
End bp | 556080 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643697521 |
Product | hypothetical protein |
Protein accession | YP_002354163 |
Protein GI | 217968929 |
COG category | [S] Function unknown |
COG ID | [COG2187] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATGCAT CCGAAACCCT GCTGAACACG CTGCGCCGGC CCGGTGTGCT GCCCGGCTCG GAGGGCGGCG TCGAGCTGAT CGAGACCCAC ATCTCCTGGG TGCTGCTCGC CGGCGAGCAC GCCTGGAAGC TGAAGAAGCC GCTCGACCTC GGCTTCCTGG ACTTTTCCAC GCCGGCGCGC CGGCGCGAGG CCTGCGAGGC GGAGCTGCGG CTCAACCGCC GCACGGTGCC GGAGATCTAC GAGGCGGTGG TGGCGGTGCG CGGCAGCCCC GAGGCGCCCC GCCTCGACGG TGGCGGCGAG CCCTTCGACT GGCTGCTGCG CATGCGCCGC TTCGACCAGG CCGGGCTGTT CTCGCGCCTG CTCGACGAGG GCAGGCTGGC GCCCGCGCTC TTCGATCGCC TCGCCCGCCA TGTCGCCGCG TTCCACGCCG CCGCCGCGGT GGCGTTGCCG GGCGGGGCGT TCGGCGATGC GGCCGCGGTG CATGCGCCGG TGCGGCAGAA CTTCGCGCAG ATGCGCGAGC ACCTCGATCC CGGCGTCCAT GCCGACCTGC TCGCGATGCT GGCCCGGGTG GAGGCCTGGG CCGAGGCGCA GTACGCCGCG CTCGCGCCGG TGTTCGCCGT GCGCCTGGCT GAAGGTCGAG TGCGCGAATG CCATGGCGAC CTCCACCTGG GCAACCTGGT CGTGCTCGAC GGCGAGCCGC GCCTCTTCGA CGCCATCGAG TTCAGCGCCG AGCTGCGCTG GACCGACGTC GTCGCCGACG TCGCCTTCCT GGTCATGGAC CTGCAGGCGC GCGGGCGGGC CGCGCTGGGT TGGCGCTTCC TCAATGCCTG GCTGGAGCGC TGCGGCGACT ACGCCGGCCT GCGCGTGCTC CCCTACTACC TCAGCTATCG CGCGATGGTG CGCGCCAAGA TCGCCGCGAT CCGCGCCAGC CAGCTCGACG GCGCCGAACG CAGCGAGAGC GTGGACGAAT GCCGTCGCTA CCTCGCGCTC GCCGAGGCGC AGAGGGCGCT GGGCGCGCCG GCCTTGCTGG TGGCCTGCGG CGTCGCGGGC GCGGGCAAGA CCAGCCAGTC GCAGGCCCTG GTGGAGGCGG GCGGGGTGAT CCGCGTGCGC GCCGACGTCG AGCGCAAGCG CCTCGCCGGC CTGGCGGCGG AGGCGACGAG CGGCTCCGGC CTCGGCGGCG GGCTATATAC CCCGGAGGCC ACCGCGCGCA CCTATGCGCG CCTGGCCGTG CTGGCGCGCG TGGTGCTCGA GGCCGGGCGC CCGGTGCTGG TCGACGCCAC CTTCCTGAAA CGCGCCCAGC GCGCGGCGTT CGCCGGGCTC GCGCAGGAGC TCGGCGTGCC CTTCGCCATC CTCGCCTTCG ATGCGCCGGA GGACGTCCTG CGTGCGCGCG TGCGCGCCCG CCTGGCCGCC GGCGGCGACG CCTCGGAGGC CGACGAGGCG GTGCTCGAGG CGCAACTGCG CAGCCGCGAG GGCTTCGCTG CCGAGGAACT CGCGCGCGTC CTGCCGATCG ATACCCGGCC AGTGCCGGAT TGGCGCAGCC TGCTGCCGGC GCTGTCGAGG CTGTGGCCGG GGGCTCCGCG CACGGACGGC CGGCTCACGT CCAATTGA
|
Protein sequence | MHASETLLNT LRRPGVLPGS EGGVELIETH ISWVLLAGEH AWKLKKPLDL GFLDFSTPAR RREACEAELR LNRRTVPEIY EAVVAVRGSP EAPRLDGGGE PFDWLLRMRR FDQAGLFSRL LDEGRLAPAL FDRLARHVAA FHAAAAVALP GGAFGDAAAV HAPVRQNFAQ MREHLDPGVH ADLLAMLARV EAWAEAQYAA LAPVFAVRLA EGRVRECHGD LHLGNLVVLD GEPRLFDAIE FSAELRWTDV VADVAFLVMD LQARGRAALG WRFLNAWLER CGDYAGLRVL PYYLSYRAMV RAKIAAIRAS QLDGAERSES VDECRRYLAL AEAQRALGAP ALLVACGVAG AGKTSQSQAL VEAGGVIRVR ADVERKRLAG LAAEATSGSG LGGGLYTPEA TARTYARLAV LARVVLEAGR PVLVDATFLK RAQRAAFAGL AQELGVPFAI LAFDAPEDVL RARVRARLAA GGDASEADEA VLEAQLRSRE GFAAEELARV LPIDTRPVPD WRSLLPALSR LWPGAPRTDG RLTSN
|
| |