Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1418 |
Symbol | |
ID | 7083500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 1580355 |
End bp | 1581581 |
Gene Length | 1227 bp |
Protein Length | 408 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643698435 |
Product | hypothetical protein |
Protein accession | YP_002355073 |
Protein GI | 217969839 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0436845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGCTTCC TGTTAACTTT TTTCTGCGCC GTGATGGCGC TCGCCGCGGC GCCCCTCCAC GCCCAAAGCC AACCTCCCCC CCTCGCCGCC TCCATGGACG CACGATGGAA GGGCGCCGCC TTCGAGGGCG ACCGCGCCAG CCTCGCCCGC CTCGCCGCCG CCGGCGCCAA GGTCGTGCGC GTGTACCGCC AGTCCGACGC CTGGGTGCTC GACGAGGCGC ACCGCCTCGG CCTGAAGGTG GTGATGGGGC TGTGGCTGGA GCACCCGCGC CACGGTTTCG ACTACGCCGA CGCGCGCGCC ATGCGCGCAC AGGAAGACGC CCTGCTCGAC TTCGTCGCCC GCCACCGCAA GCACCCTGCC CTGCTCGCCT GGGGCGTGGG CAACGAGATC GAGACCGGGG TCGCCGACCC GCTGCCGCTG TGGCGCGCGG TCGACCGGCT GGCCGCGCGC ATCCGCGCGC TCGACCCCGA CCACCCCACC ATGATGGTGG TCGCCGACAC CGGCATGGAC GCCTTCCGCG CACTCGCCGG CTGCTGCCCC AACGTCGAGC TGCTCGGCAT CAACGTCTAC GCCGGTGCGG TGTTCGACCT GCCGCAGCGC CTGCGCGCGG CCGGCATCGC CAAGCCGGTG GTGGTCGCCG AGCTCGGCCC GCTCGGGCAG TGGCAGGCCG GGCGCAAGCC CTGGGGCGCG CCGGTCGAGC TCACCAGCAC CGAGAAGGCG CGCTTCTTCA CCGAGGCCCT CGCCTTCCTC GACCAGCAGG CGCAGATCCG CGGCGTCTTC CCCTTCCTGT GGGGCGCGAA GCAGGAACAG ACCGCGACCT GGCACGGACT GCTGCTCGCC GACGGTAGCC CCACCGCGAT GAGCGACGCT CTCGCCGCCG CCTGGGGGCG GCCGCAGCCT CGGCCCGCAC CGCGCATCCG CGGCATCGGC ATCGGCGCGG ACGAGTTCGC CGCCGGCGCG GAGATCTCCG CCGGCATCGA CGCGGTCGCC CACGACGGCA GCGCGCTCGC CGCCGAGTGG GCCGTGCACG CCGAGGCCAC CGACCTGCGC AAGGGCGGCG ATGCCGAGAC CCCGCCCACG CGCATCGACG TGCGCGTGCT GCACGCCGAT GCCGCCAGCG TGAGCTTCGT CGCCCCGGCG CAGCCCGGCG CCTACCGCCT CTTCATCACC GTGCGCGACC GCGAGGGCAA GGCAGGGACG GCGAACCTGC CGTTCCGGGT GAGGTAA
|
Protein sequence | MRFLLTFFCA VMALAAAPLH AQSQPPPLAA SMDARWKGAA FEGDRASLAR LAAAGAKVVR VYRQSDAWVL DEAHRLGLKV VMGLWLEHPR HGFDYADARA MRAQEDALLD FVARHRKHPA LLAWGVGNEI ETGVADPLPL WRAVDRLAAR IRALDPDHPT MMVVADTGMD AFRALAGCCP NVELLGINVY AGAVFDLPQR LRAAGIAKPV VVAELGPLGQ WQAGRKPWGA PVELTSTEKA RFFTEALAFL DQQAQIRGVF PFLWGAKQEQ TATWHGLLLA DGSPTAMSDA LAAAWGRPQP RPAPRIRGIG IGADEFAAGA EISAGIDAVA HDGSALAAEW AVHAEATDLR KGGDAETPPT RIDVRVLHAD AASVSFVAPA QPGAYRLFIT VRDREGKAGT ANLPFRVR
|
| |