Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0498 |
Symbol | |
ID | 7085009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 559939 |
End bp | 560970 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643697527 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_002354169 |
Protein GI | 217968935 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.962606 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAAAC GCCGCTTCAC CGCCCTCATC GCCGGCCTGT TCGCCTCGAC CGCGCTCGGT TTCTCGATGC CGGCCACGGC CCAGCAGTAC AAGGACGAGT ACAAGCTCTC CACGGTGCTC GGCGAGGCCT TCCCGTGGGG CTGGGGCGCC AAGCGCTGGG CCGACCTGGT CGCAGAGAAG ACCGAAGGTC GCATCAAGAT CAAGGTGTAT CCGGGCACTT CGCTGGTGTC GGGCGACCAG ACCAAGGAAT TCACCGCGCT GCGCCAGGGC ATCATCGACA TGGCGGTCGG TTCCACGATC AACTGGTCGC CGCAGGTCAA GGAGCTCAAC CTGTTCGCGC TGCCCTTCCT GATGCCCGAC CACAAGGCGA TCGACGCGCT CACCCAGGGC CGCGTCGGCA AGAAGATGTT CGACATCCTC GCCGAGCGCG ACGTGGTGCC GCTGGCCTGG GGCGAGAACG GTTTCCGCGA GGTCTCCAAC TCGAAGAAGC CGATCCGCAC GCCCGAGGAC GTCAAGGGCA TGAAGATGCG CGTGGTCGGT TCCCCGCTCT TCCTCGCCAC CTTCAACGCG CTCGGCGCCA ACCCGACGCA GATGAGCTGG GCCGACGCCC AGCCGGCGAT GGCGACCGGC GCGGTCGACG GCCAGGAGAA CCCGCTCGCG GTGTTCAACG CCGCCAAGCT GCACACCGTG GGGCAGAAGA ACCTGACCCT GTGGGGCTAC GTCGCCGACC CGCTGATCTT CGTGGTGAAC AAGTCCGTGT GGAACTCGTG GTCCGAGGCC GACCGCAAGG CCGTGTCCGA GGCCGCGCAG CAGGCTGCGA AGGAAGAGAT CGCGCGCGCG CGCGCCGGCA TCTCGGCGGC CGACGACGCG CTGCTGAAGG AGATCGAGGC CAACGGCGTG GCCGTGGTGC GCCTGACCGA TGCCGAGCGC GACGCCTTCC GCCAGGCCAC CGCGGGCGTG TACAAGGAGT GGGCCGAGAA GATCGGCGCC GACCTCGTCA AGCAGGCCGA GGAAGACATC GCCAAGCGCT GA
|
Protein sequence | MQKRRFTALI AGLFASTALG FSMPATAQQY KDEYKLSTVL GEAFPWGWGA KRWADLVAEK TEGRIKIKVY PGTSLVSGDQ TKEFTALRQG IIDMAVGSTI NWSPQVKELN LFALPFLMPD HKAIDALTQG RVGKKMFDIL AERDVVPLAW GENGFREVSN SKKPIRTPED VKGMKMRVVG SPLFLATFNA LGANPTQMSW ADAQPAMATG AVDGQENPLA VFNAAKLHTV GQKNLTLWGY VADPLIFVVN KSVWNSWSEA DRKAVSEAAQ QAAKEEIARA RAGISAADDA LLKEIEANGV AVVRLTDAER DAFRQATAGV YKEWAEKIGA DLVKQAEEDI AKR
|
| |