Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0808 |
Symbol | |
ID | 7084200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 895067 |
End bp | 896314 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643697832 |
Product | aromatic hydrocarbon degradation membrane protein |
Protein accession | YP_002354473 |
Protein GI | 217969239 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.124211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATGA ACAAACTTGC GATCAACATC CTTGGCGCAA CGCTCGCACT CGCGGGCGGC AACGCCATGG CTGCGGGTTT CGCACTGCAG AACCAGAACG GTGCGGGCAC GGGGAACGCC TTTGCTGGCG CGGCCGCTGC GGCCGAAGAT GCTTCGACCG TCTATTTCAA TCCGGCAGGG ATGCTGCTGT TGCCCAAGGG GCACAACATT ACCGGTGCCG TCACTTTCCT CGACCGAAGC GTGGAGTTTT CCGATCGTGG AACGGCTCGC TTGGTGCCCT TCGCTCTGGG TAGCGACGGT GGCGATGGCG GCAGTCTCGC GATCGTGCCG GCTGCTTACT GGTCGTATGC AGTCAGTCCC GATCTGGCCG TTGGTCTGGG CGTTGGGCCG ACCTTCGGCA ACAAGACCGA GTTCGACCGG GATTTCATCG GGCGCTTCTC CGGGTATTTT GCCGAGATCA AACAGATCAA CATCAATCCG TCGATCGCCT ACCGCGTGAA TGACATGGTC GCCTTGGGGT TCGGTCTCAA TTTCGCCAAG AACGAAACCG AATTCAAGCA AATGGCGCCC CTTGCCGCAG GCATTCCGAT TGGCGTCCCG GTCACCATCA AGGGAGACGA CACCGCGTTC GGCTGGAATG CTGGCCTGAT GATTCAAGCG ACACCTGCGA CTCGCGTCGG CGTCACTTAT CGCTCGGAAC TCAGTTTCGA TCTGAAAGGG ACTCAGGAGA TTTCCGGCGT TCGCACCTTC GACGTGAAAG CCAAGCTGAA GACTCCCGAT CAGTTCTCGT TTGGCATGCA CCATGCCGTC GATTCGAAAC TCGAGTTGCT CGCCGACCTG ACGTGGACTG GCTGGAGCTC CATCGAATCC ATCAAGGTCC GTGGTGGTAC GAACCCCGAG TTGCCCTATC ACTTTAAGGA CACCTGGCGT GTCGGTCTGG GTGTCGGCTA CCAGATGAAC AACCAGTGGA AGCTCAGGGC TGGCGTCGCA TTCGACGAGG CGCCTGTTCG TTCGGCGGCT GATCGCACGA TGACCCTGCC CGATACGGAT CGCACCTGGC TGGCGCTGGG TGCGCGTTAC ACGCTGAACA AGAACGCATC GATCGACGTC GGTTATGCAC ACATCTTCTT CAAGGAAGGG CCGACCGAGC GCATCGTCTA CAACGGCAGT ACGCCCATCC AGCAGATCAA AGGCAAGTTC GACGTGAGCG CCGATCTCCT TTCTGTTCAG TACAACCACA ATTTCTGA
|
Protein sequence | MQMNKLAINI LGATLALAGG NAMAAGFALQ NQNGAGTGNA FAGAAAAAED ASTVYFNPAG MLLLPKGHNI TGAVTFLDRS VEFSDRGTAR LVPFALGSDG GDGGSLAIVP AAYWSYAVSP DLAVGLGVGP TFGNKTEFDR DFIGRFSGYF AEIKQININP SIAYRVNDMV ALGFGLNFAK NETEFKQMAP LAAGIPIGVP VTIKGDDTAF GWNAGLMIQA TPATRVGVTY RSELSFDLKG TQEISGVRTF DVKAKLKTPD QFSFGMHHAV DSKLELLADL TWTGWSSIES IKVRGGTNPE LPYHFKDTWR VGLGVGYQMN NQWKLRAGVA FDEAPVRSAA DRTMTLPDTD RTWLALGARY TLNKNASIDV GYAHIFFKEG PTERIVYNGS TPIQQIKGKF DVSADLLSVQ YNHNF
|
| |