Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0545 |
Symbol | |
ID | 7085159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 614108 |
End bp | 615148 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643697572 |
Product | TRAP dicarboxylate transporter, DctP subunit |
Protein accession | YP_002354214 |
Protein GI | 217968980 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.513841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACC AGCTCGCAAC CCCCCTCCGC CGCCGCATCC TCGCGACAGG CCTCGCCCTC GCCGCTTTCG CACTCGCCGG ATGCTCGGGC GAACAGAAAG GCACGCAACC CGCGACCACA GCGACGAACC AGAAGCTGGT GATCAAGGTC GGCCACGCCG CCACGGAGTC CAACACCGGC CACAAGGGGC TGGTCGAGTT CAACCGCCTG CTGGGCGAAA AGACGGGCGG CCGCATCAGC CTGGAGATCT ACCCCAACTC GCAGCTCGGC AGCGAACGCG AGCTGATCGA AGCGGTGCAA CTGGGCAGCG TCGGCATGAC CTTCGTGTCC TCGGCCCCGC TGGGCGGCTT CAAGAAGGAG TTCTTCGCCC TCGACCTGCC CTTCGTGTTC AAGGATCGTC CCACCGTCTA CAAGGTGCTC GACGGCGAAC CCGGCCAGTA CCTGCTGAAG AGCCTCCAGG ACATCAACAT CCAGGGCCTC GGCTTCTGGG AGAACGGTTT CCGCCAGCTC AGCAACAGCA AGGTGGCGGT GAAGACGCCG GATGACCTGA AGGGCATCAA GATGCGCACG ATGGAGAACG AGGTCCACCT CGCCGCCTGG AAGGAGCTGG GTGCCAACCC GGCGCCGCTG GCCTTCGGCG AGCTGTTCAC CGCGCTGCAG CAGGGCACCT TCGATGCCCA GGAAACCCCG ATCAACCTGT TCCGCGACAT GAAGTTCTTC GAGGTGCAGA AGTTCATCAC CAAGACCGGC CACCTCTACT CGCCCTTCGT CATCCTGATG AGCAAGCCGC TCTACGACGG TCTGAGCGAG GCCGACAAGC AGGCCATGGC CGAAGCCTTC GAAGCCGCCA AGGCGTACCA GCGCGACCTC GCGCAGAAGA GCGACGCCGA GGCCGAAGCG CAGATGACCG GCATCACCTT CACCGAATTG AGCGACGCCG AGAAGGAAGC CTTCCGCGCC AAGATGGGGC CGGTGTACGG CCTGGTGAAG AAGAAGGCCG GCGAGGAAAT CGTCAAAAAG GTGCTGCAGG CCACCAACTA A
|
Protein sequence | MSNQLATPLR RRILATGLAL AAFALAGCSG EQKGTQPATT ATNQKLVIKV GHAATESNTG HKGLVEFNRL LGEKTGGRIS LEIYPNSQLG SERELIEAVQ LGSVGMTFVS SAPLGGFKKE FFALDLPFVF KDRPTVYKVL DGEPGQYLLK SLQDINIQGL GFWENGFRQL SNSKVAVKTP DDLKGIKMRT MENEVHLAAW KELGANPAPL AFGELFTALQ QGTFDAQETP INLFRDMKFF EVQKFITKTG HLYSPFVILM SKPLYDGLSE ADKQAMAEAF EAAKAYQRDL AQKSDAEAEA QMTGITFTEL SDAEKEAFRA KMGPVYGLVK KKAGEEIVKK VLQATN
|
| |