Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3578 |
Symbol | |
ID | 7873083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 3922171 |
End bp | 3923469 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643700518 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002890548 |
Protein GI | 237654234 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0372089 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGGCC AGTTCCGCCT GCTGCGGGAG CGCCGCTTCC TGCCCTTCTT CCTGACCCAG TTCCTCGGCG CCTTCAACGA CAACGTCTAC AAGAACGCGC TGGTGGTGCT GATCACCTTC CAGGCCGCGC GTCTGGGGGC GGACTCGCCC GGGGTGCTGG TGAACCTCGC CGCGGGTGTC TTCATCCTGC CCTTCTTCCT GTTCTCGGCC ACCGCCGGCC AGCTCGCCGA CAAGTACGAG AAGAGCCGCC TGATCCGCGC CACCAAGCTG CTCGAGATCG CCGTGATGGC GCTTGCGGTG GTCGGCTTCG CCCTGATGTC GCTGCCGCTG CTGCTCGTCG TGCTGTTCCT GATGGGGGCG CAGTCGGCAC TGTTCGGGCC GGTCAAGTAC GCGATCATCC CGCAGCAGCT CGCCGACGAC GAGCTGGTGG GCGGCAACGC GCTGGTGGAG GCGGCGACCT TCGTCGCCAT CCTCGCCGGC ACCATCGTCG GCGGCCTGCT GGTGGCGGGG GATGCCGGGC CCGGCCGTGT GGCGGTCGCG GTGCTCGCGA TCGCGCTGCT CGGCTGGTGG TCCAGCCGCG CCATCCCGCC CGCGGCGGCG GCCGACCCCG GGCTGCGGGT GAACTGGAAC CCGGTGACGC AGACGCGCGA GATGCTGCGC TTCATGCTCG AGGCGCGCGC GGTCTTCGTC GCCATCGTCG GCATCTCGTG GTTCTGGTTC TACGGTGCCG TGTTCCTGTC GCAGTTTCCC GGCTTCGCCG CGGATCATCT CCGCGGCGAC GAGCGCGCGG TGACGCTGCT GCTGGCGCTG TTCTCGGTCG GCATCGGCGC GGGCTCGCTG TTGTGCGGGC GCCTGTCGCG CGGGCGGGTG GAGCCGCGCA TGGTGCTGCC GGGGGCGATC GGGCTGAGCC TGTTCGCGCT CGACCTGTGG TGGGCGAGCC CGGCGGCGGG CGCCTTCCCG CCCGGCCAGG GGCTGGATAC GCTGTTTGCC CGCGCCGAGG TGTGGCGGGT GGTGTTCGAC CTCGTGATGA TCGGGGTGTG CGGCGGCTTC TTCATCGTGC CGCTGTATGC GCTCGTGCAG CAGCGCTCCG CGCCGGCGCA CCGCGCGCGC GTGATCGCCG GCAACAACAT CCTCAACGCG CTCTTCATGG TTGCCGCCGC GGCGATGGGC ATCGGGCTGC TGGCCGCGGG CTTCGCGGTG CCGCAGCTCT TCCTGGCCAC GGCCTTGCTC AACGCGCTGG TCGCCGCGCT GCTGTTCGCC CGCGAGCCCG CCTTCCGCCA GCGCGGCGGC CCCCCGTAG
|
Protein sequence | MSGQFRLLRE RRFLPFFLTQ FLGAFNDNVY KNALVVLITF QAARLGADSP GVLVNLAAGV FILPFFLFSA TAGQLADKYE KSRLIRATKL LEIAVMALAV VGFALMSLPL LLVVLFLMGA QSALFGPVKY AIIPQQLADD ELVGGNALVE AATFVAILAG TIVGGLLVAG DAGPGRVAVA VLAIALLGWW SSRAIPPAAA ADPGLRVNWN PVTQTREMLR FMLEARAVFV AIVGISWFWF YGAVFLSQFP GFAADHLRGD ERAVTLLLAL FSVGIGAGSL LCGRLSRGRV EPRMVLPGAI GLSLFALDLW WASPAAGAFP PGQGLDTLFA RAEVWRVVFD LVMIGVCGGF FIVPLYALVQ QRSAPAHRAR VIAGNNILNA LFMVAAAAMG IGLLAAGFAV PQLFLATALL NALVAALLFA REPAFRQRGG PP
|
| |