Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0268 |
Symbol | |
ID | 7085569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 303232 |
End bp | 304323 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643697309 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002353957 |
Protein GI | 217968723 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGATC AGGTTGCCGA GAAGATTGTC CGCAGAGTCA TCCTCGGCTT CCTGCTCGGC GGTCTGTTGG TCCTCAGCTA TGCCGTGCTG CACGTTTTCA TCGTACCCGT GGCCTGGGCG ATCATCATTG CATTTGCGAC CTGGCCGCTT TATCGGAAGC TGCGCGCGCG CCTGAGGCGT TACCCGACGG TCAGCGCGCT GCTGATGACC TTGCTGCTCA GCGCTGCATT CATTCTGCCC GCGCTATGGA TGGGCGCGCT GCTGCGAACG GAAGTCGGTG TGGCCATTGC CACGGTCACG GCGCAGATCA AGGCGGGCTC CCTTGCGCTC CCGGATTTCA TTCGCTCGAT GCCGTGGGTG GGCGATTGGC TGCAGTCGCT CCTCGACAGC CTTACCGGTG ACCCCGACGC CTTCAGGGCA CAACTCACCG AGTGGGTGCG GCAGGGCAGC GACCAAGCGG TCGCACTCAT CGGCGATGTG GGGCGGAACG CCGCCAAGCT CGGCTTTGCG CTGATCACCG TGTTTTTCCT GTACCGCGAT GGCGACCGTG TGCTGGCCCA GGTCCAGGTG GTGCTGCACC GGTTTCTGGG CGAGCGGGTC GACGCCTACC TCGCCGCTGT CGGTGGCATG ACGAAAGCGG TGGTCTGGGG GCTGATTGCG ACCGCGCTGG GGCAAGGCCT GGTGGCTGGG CTGGGTTATT GGTGGGCGGG CCTTCCCGCG CCGGTGCTGC TCGGGGCGGT GACCGCCTTG ATTGCGATGA TCCCCTTCGG CACTCCATTC GCGTGGGGCT CCCTGGGCGT CTGGTTGCTC GTCAGTGGCG ATACGGCCGC CGGCATCGGC CTGCTGCTAT GGGGAACCTT GGTCGTGAGC TGGGTCGACA ACCTCATTCG CCCCCTTGTC ATCAGCAACG CAACCCAGAT ACCGTTCCTG CTCGTGATGT TCGGGGTCCT CGGTGGCTTG GCGGCATTTG GCCTGGTCGG CCTCTTCCTG GGGCCGGTGG TCTTGGCAGT CCTGATGGCG GTATGGCGGG AATGGATCGA GGAGTCCGAC CTAGTCAGCC TCAATCATGC TTCATCCACA AGCAAGAGTT GA
|
Protein sequence | MIDQVAEKIV RRVILGFLLG GLLVLSYAVL HVFIVPVAWA IIIAFATWPL YRKLRARLRR YPTVSALLMT LLLSAAFILP ALWMGALLRT EVGVAIATVT AQIKAGSLAL PDFIRSMPWV GDWLQSLLDS LTGDPDAFRA QLTEWVRQGS DQAVALIGDV GRNAAKLGFA LITVFFLYRD GDRVLAQVQV VLHRFLGERV DAYLAAVGGM TKAVVWGLIA TALGQGLVAG LGYWWAGLPA PVLLGAVTAL IAMIPFGTPF AWGSLGVWLL VSGDTAAGIG LLLWGTLVVS WVDNLIRPLV ISNATQIPFL LVMFGVLGGL AAFGLVGLFL GPVVLAVLMA VWREWIEESD LVSLNHASST SKS
|
| |