Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2275 |
Symbol | |
ID | 7083707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 2559156 |
End bp | 2560073 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643699294 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_002355910 |
Protein GI | 217970676 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACC GTGCCCACGC GCTTGCCGCC GGGCTGTTCG CAATCTTGCT CGGTGCCGGC GTCGTGTTCG CGATCTGGTG GTTTTCGGGC CAGCGCGTGC CGATGCGCGA GATCGTGCTC GAGGCACGCG GCGACATCAA CGGCCTGGGC GCACAGTCGC GCGTGCGCTA CCGCGGCATG GCGGTGGGGA GCGTGCGCGC CGTCGCGATC GACCCCGAGG ACCTGCGCAC GCTGCTGGTG CGCATCGCGG TGCCCGCCGA TCTGCCCCTG ACCCGCGGCA CCACCGCCGC GCTCGGCACG CTGGGCGTGA CCGGCCTGGC CTTCGTGCAG CTCGACGATC GCGGGGTGGA CCGCCGGCCC TTGGCTGCCG CCGATGGCGA GGTGCCGCGC ATCGCCTTGC AACCCGGGCT GGTGGAGGCG CTCTCCGGGC GGGCGCTCGC CGCCCTCGAC CAGTTCCAGG CGCTCGGCGA GCGCCTGCAG GGGGCGCTGG CCAGCCTGGA GTCAGCGGCC GCGGGCGTCG ATCGCGGGGT GGCGGAACTG CCGGCGACCC TGGCCGCGCT GCGGGCCGCC TTGAGTGCCG GGAACATCGC GCGCATCACC TCCGTGCTCG GGAACCTCGA GCGCGGCTCG GCCGAAGCCG TGCCCGCCGT TGCCGAGCTG CGCCGCCTGA TCGTCCGCAT CGATCAGGCG GCCGCGCGTC TGGAGCTCCG CGCCGCCGCC GCAGGGGACG ACCTCGTCGA GCGCACGCTG CCGCAGCTCG ATGTCCTGCT CGGTGAGCTC ACGAGCACCT CGCAACGCTT CGGCCACCTG GTCGAGGAGC TCGACGCCTC GCCGCAGCTC CTGCTGACCG GGCGCGACCG TCCTCTGCCC GGGCCCGGTG AGGCGGGCCA CGAGGGCTTT GCGGGGAGCG ACCGATGA
|
Protein sequence | MENRAHALAA GLFAILLGAG VVFAIWWFSG QRVPMREIVL EARGDINGLG AQSRVRYRGM AVGSVRAVAI DPEDLRTLLV RIAVPADLPL TRGTTAALGT LGVTGLAFVQ LDDRGVDRRP LAAADGEVPR IALQPGLVEA LSGRALAALD QFQALGERLQ GALASLESAA AGVDRGVAEL PATLAALRAA LSAGNIARIT SVLGNLERGS AEAVPAVAEL RRLIVRIDQA AARLELRAAA AGDDLVERTL PQLDVLLGEL TSTSQRFGHL VEELDASPQL LLTGRDRPLP GPGEAGHEGF AGSDR
|
| |