Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3525 |
Symbol | |
ID | 7873031 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3861998 |
End bp | 3863413 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643700466 |
Product | RND efflux system, outer membrane lipoprotein, NodT family |
Protein accession | YP_002890496 |
Protein GI | 237654182 |
COG category | [M] Cell wall/membrane/envelope biogenesis [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1538] Outer membrane protein |
TIGRFAM ID | [TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAATC CATCCGGTTT CCCCCTGCGC ACCCGTCTGC TCCCTGCCCT GCTCGCCGCC CTGACCGCCG GCTGCGCGCT CACCGAGCCC GTCGTGCGTC CCGAGCAAGC GCTGCCGGCG CAGTGGGCGG AACCCGCCCG GGCGGCAGAA CCCGCCACAC CGCTGCATGA CACCTGGTGG CAGGACTTCG GCTCGGCCCG GCTCGACGCC TTCGTCACCG AAGCGCTGGC CACCACCCCC GACCTGCGCA TCCAGGCCGA ACGCGTGGTC CAGGCCGAGC TTGCGCTGCG CCAGGCCGGG GCGTCGCTAT TGCCGTCACT CGACGTGAGC GGCGGGAGCA GCACACGCAA TGTCGACAGC AATGAATCCA GCTCGACCGG CATCGACCTC GGCGCCAGCT ACGAGATCGA CCTCTGGGGC CGGATCGCCG CCGGCGTCGG CGCCAGCCGC GCCGGTCTCA TGGCCACCCG CTTCGACTAT GACGCGGCCC GCCTGTCGAT CAGCGCCAGC GTCGCCATCG CCTGGTTCCA GGTCCTGGCG CTGCAGGAGC GCCTGGACAT CGCGCGCCGG AACCTCGCCA CCGCCGAGCG CGTGCTGCGC GTGGTGCAGG CGCGCTACGA CAACGGTGCG GCCTCGGCGC TCGATCTGAG TCAGCAGCGC ACCACGGTGC TCAACCAGCG CAAGGCCATC GAGCCGCTCG AGGTGCAACT GCGCCAGACG CGCAGCGCGC TCGCAATCCT GCTTGGCCGC AACCCGCAGG CCGGACCCAC CCCCGACGGC GCTGGCGGCA TCGAGCGCCT GGGGGCCCTG AAGGTGCCCG CGGTCGGCGC CGGCTTGCCG TCCGAGCTGC TGCTGCGCCG CCCCGACCTC GCCGCCAGCG AAGCCCGGCT CGTCGCCGCG GCCGCCAACA TCGCCGCCGC GCGCGCCGCG CTGCTGCCCG GGATCAGCCT GTCGGCCGGT GCGGGCGTGG GCAGCGCCGC GCTGCTGGCG CTGGCCGACA CCACACGCAC GCTGTCGATC TCGGCCAGCG TGCTGCAGAA GATCTTCGAC GGCGGCCGCC TGCGCGCGGA CGTCGACATC CAGCGCTCGC GCCAGCGTGA GCTGGTCGAA TCCCACCGCC GCGCGATCCT CGCCGCCCTC AAGGAAGTCG AGGACGTCCT CGCCAACGGC GTCCGCGACA CCAACCAGGA GGCCGCCGAG CGCGAGATCC TCGCCGAAGC CGAGCGCAGC CTGCGCCTGG CCGAACTGCG TTACCGCGAA GGCGCGGACG GCCTGCTCAC GGTACTCCTC GCCCAGCGCA CCCTGTTCGC GTCGCAGGAC CAACTCGCCC TCACGCGCCT GGCCCGACTC ACCGCTGCAG TGAATCTGTA CAAGGCGCTC GGGGGCGGGT GGAGCGCGGG GCAGGCGGGC AACTGA
|
Protein sequence | MTNPSGFPLR TRLLPALLAA LTAGCALTEP VVRPEQALPA QWAEPARAAE PATPLHDTWW QDFGSARLDA FVTEALATTP DLRIQAERVV QAELALRQAG ASLLPSLDVS GGSSTRNVDS NESSSTGIDL GASYEIDLWG RIAAGVGASR AGLMATRFDY DAARLSISAS VAIAWFQVLA LQERLDIARR NLATAERVLR VVQARYDNGA ASALDLSQQR TTVLNQRKAI EPLEVQLRQT RSALAILLGR NPQAGPTPDG AGGIERLGAL KVPAVGAGLP SELLLRRPDL AASEARLVAA AANIAAARAA LLPGISLSAG AGVGSAALLA LADTTRTLSI SASVLQKIFD GGRLRADVDI QRSRQRELVE SHRRAILAAL KEVEDVLANG VRDTNQEAAE REILAEAERS LRLAELRYRE GADGLLTVLL AQRTLFASQD QLALTRLARL TAAVNLYKAL GGGWSAGQAG N
|
| |