Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3561 |
Symbol | |
ID | 7873067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3903503 |
End bp | 3904498 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700502 |
Product | Ankyrin |
Protein accession | YP_002890532 |
Protein GI | 237654218 |
COG category | [R] General function prediction only |
COG ID | [COG0666] FOG: Ankyrin repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.466152 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGAAA TCGAATCACC GCTGGCATTC CTGCTGACCA CGCCGTACGC GAGCCGCTAT CCGCGCAAGC TCGTCGAGGG CTTTCCCCAT GTCGCCCGCC GCATCGCGGA GCTCTGGGAC GACAAGGACA AGCTGGGCGA ATACTTCACC GAGCTCATGG TCTCGAAACG GCCGAACCGG CGCGGTTTCC CGCCCGAGGT GGGCGCAGAG ATCGTTTATC TGAGCATGGC CTACGACCTC CACGGCCCGG TGCGTCCCAC CCCGCAAGCG GCCCCCGAGC CCACGACGGC CGCGCGCGAC GACGCCTGGG ACTACGAGCG CGCGGTCGCC GAGCTCGAGC GCCTCGACAT TCCGATCACG ATGGCGCAGT TCGTGCGCGC GCTGGAGGCT GGCGACCAGC ACCTGTGCTC GCTCTTCCTG CATGCCGGCT TCGACATCGA CGGCCGCGAC GCGCGCCAGT GGACGCCGCT GATGATCGCC TGCTTCCATG GCCGCGAGGC GCTCGCCCTC GAGCTGATCC GGCTCGGCGC CAGCGTGGAT GCGACGGACG CCGACGGCTA CACGCCGCTG CACTGGGCCT CGGTGAACGG CTACCAGAAG GTCGGCGAGG TGCTCGTGCG CCGCCAGGCC GAGGTCAATG CCACGAGCAA CGCCGGCATC ACGCCGCTGC TGCAGGCCGC GGCGCGCGGC CACCTGGGCG TCGTGCGCCT GCTGCTCGAC CGCAAGGCCA AGGTCAATCT CGTCGCCGCG GACGGCTCCA CGGCACTGCT CAAGGCGGTC GCCAACGGAC ACTGGGAGAT CGTCAACACG CTGCTCGACG CCGGCGCCTC CACGCAGGCG ACGATGAAGA ACGGCACCAC CCTGGTCGAC ATCGCCGCAC GTTCGAAGGA CGAGCGCATC CGCGAACGCA TCGCCATCGC CGCGCGCATG GAGGCGCGCG GCGATGCGCC GATCCAGCGC GACGAGCCGC CACCGCTCAC CGGCACGATC TACTGA
|
Protein sequence | MAEIESPLAF LLTTPYASRY PRKLVEGFPH VARRIAELWD DKDKLGEYFT ELMVSKRPNR RGFPPEVGAE IVYLSMAYDL HGPVRPTPQA APEPTTAARD DAWDYERAVA ELERLDIPIT MAQFVRALEA GDQHLCSLFL HAGFDIDGRD ARQWTPLMIA CFHGREALAL ELIRLGASVD ATDADGYTPL HWASVNGYQK VGEVLVRRQA EVNATSNAGI TPLLQAAARG HLGVVRLLLD RKAKVNLVAA DGSTALLKAV ANGHWEIVNT LLDAGASTQA TMKNGTTLVD IAARSKDERI RERIAIAARM EARGDAPIQR DEPPPLTGTI Y
|
| |