Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0943 |
Symbol | |
ID | 7085046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1034012 |
End bp | 1035418 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697965 |
Product | hypothetical protein |
Protein accession | YP_002354605 |
Protein GI | 217969371 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCT CCCTCCCTGA CTTCCTGCAC CGCGCGAAGT CGCCCCTGCG GGCCACGCAG CTCGTGGCGG CCATCACGCT GGTCGTCGGC GTCGCCTGGG CGCAGACCCG CATCGACCCC AATGGCGTGA ACGTCAGCGG CAGCGTGATC GGCGACGACG TGCTCTACAG CATCGGCGGC GGCCGGGCCG TGTCGATGGG TGGTGCCGGC AACATGCAGA GCATCGGCGT CGGCGTCGGC TGGAACAGCA ACCTGATCTG CGGCGACATG AGTATCACCA CGACACTGCA GAACCAGCTC AACGGCATCA CCAACGGCTT CCAGACGATC ATGAGCAACG TGATCCAGAA CGCCACCAGC GCCGTGGCCT CGCTGCCGGC GCTGATCATC CAGCGCGCCG ACCCGGGGCT CTACAACCTG CTGACCAACG GCATCCTCCA GGCGCGCCTG GACTTCGACC GCTCGAAGAT GACCTGCCGG GCCATCGCCA ATCGCATGGC GGATATGGCC GGCGGCCAGG CCGGCTGGGA CCAGCTCGCC GAGGGGATGG CGCTGCGGGA TGCGGTCAGC AGCACCGATG CCGTCTCCGC CATCGAGCAG GCCGAGTCCA ACAAAGGAAA CAACGGCGTG CCATGGGTCG GCGGCGGCAA CGCGGGCGGC TCCGGTCAGA GTTCGATCAA GGTGGTCGGC GACGTGACGC GCGCGGGCTA CAACCTGCTC AACGGGCGCA GCGCCACCGA TAGCTCGTCG ATCGCACCCA GCGCCTGCGG CAACCGCCTG ACCTGCCAAA CCTGGTCCTC GCCGCAGGCG GCCTCGGCCT GGGCGATCCG CGTGCTGGGC GAGCGTGAGC AGCGCACCTG CGAAAACTGC ACGAAGACCC AGACCACGCC TGGCGTCGGG CTGACGCCAA TGATCCAGGA AGAGTACGAA ACCAAGCTGC AGGTGCTGCA GGAACTGGTG ACGGGCGCCC GGCCGACGAC GCTGGCCAAT CTCGACGCGG CCGGCAGCAG CTCGCTGCCG ATCACCCGCG GCGTGATCGA GGCCTTGCGG GACGAACCCG ACCAGGACCT GCTGGGCAAG CGCCTCGCGT CCGAGGCGGC GCTGTCCAGC GTGCTGGAGA AGGCCCTGCT GCTGCAACGC ACGTTGCTGA CCGGCAAGAA GGAGCCGAAC GTCGCTGCCA ACGAGCTGGC CGTGCAGGCG GTCGACCAGG AGAACAGCGC GCTGGAGCAG GAGATCAACA ACCTCAAGAC CGAACTGGAG CTGCGCCGCA CGCTGGCCGG CAACTCGGCG ATGGCGATCA TCCAGCGCCA CAGCACCCGC GCGGCCGGCT CGCGCGGCGT CTTCGAGGGC GACACCACAC GCGACCGTCT GCGGGAAGTC CAGAAGCCGC GGAGCGGTAC GCCATGA
|
Protein sequence | MNASLPDFLH RAKSPLRATQ LVAAITLVVG VAWAQTRIDP NGVNVSGSVI GDDVLYSIGG GRAVSMGGAG NMQSIGVGVG WNSNLICGDM SITTTLQNQL NGITNGFQTI MSNVIQNATS AVASLPALII QRADPGLYNL LTNGILQARL DFDRSKMTCR AIANRMADMA GGQAGWDQLA EGMALRDAVS STDAVSAIEQ AESNKGNNGV PWVGGGNAGG SGQSSIKVVG DVTRAGYNLL NGRSATDSSS IAPSACGNRL TCQTWSSPQA ASAWAIRVLG EREQRTCENC TKTQTTPGVG LTPMIQEEYE TKLQVLQELV TGARPTTLAN LDAAGSSSLP ITRGVIEALR DEPDQDLLGK RLASEAALSS VLEKALLLQR TLLTGKKEPN VAANELAVQA VDQENSALEQ EINNLKTELE LRRTLAGNSA MAIIQRHSTR AAGSRGVFEG DTTRDRLREV QKPRSGTP
|
| |