Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2826 |
Symbol | |
ID | 7873234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3060461 |
End bp | 3061681 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643699747 |
Product | protein of unknown function DUF214 |
Protein accession | YP_002889802 |
Protein GI | 237653488 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4591] ABC-type transport system, involved in lipoprotein release, permease component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.663211 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCGT GGATGCCCTT CGAATGGATC GCCGCGCTGC GCTTCCTGCA GGAGGGACGC ATGCAGTCGC TGCTGATCAT CGTCGGCGTC GGCGTCGGCG TGGCGGTGAT CGTGTTCATG TCGGCGCTGC TGTCCGGTCT GCAGGCCAAC CTGGTGCGCC GCACGCTCAG TTCGCAGGCG CACATCGTGC TGCTGCCGGC CGAGGAGGTG GCGCGCCCGC AGGCCTCGAG CGGGCACGAC GCGATCCGCC TGCAGAAGCA GGCGCAGCGG CTGCGCTCGA TCGACCAGTG GCAACTGCTG CGCGACCGCC TCGAGGCCTG GCCCGAGATC GCCGCGGTGT CGCCGGCCGC CTCCGGCCCG GCGTTCGCGG TGCGTGGCGA CGCCAGCAAG GCGGTGACCC TGCTCGGCAT CGAACCCGCG CGCTACCAGC AGGTCATCGA CCTCGGCGGA CGCATCACCG CCGGCGAGCT GCGCGTCGGC GCCGGCGAGG CGGTGATCGG CATCGAGCTG GCCAAGGACC TCGGCGCCGA CGTCGGCGAC AAGCTGCGCA TCCGCGGCGC GCAGGGCGAG GCCGAGACGC TGACCGTGAC CGGGCTGTTC GATCTCGGCA ACAAGGGCGT CAATGCACGC AACGTCTATG TCGGCCTGCG CACCGGTCAG ACCCTGCTCG ACCTGGTCGG CGGCGTCTCC AACATCGACC TCGCCCTCCA CGACCTCGAC CTCGCGGAGG ACGTGGCGCA GCGCATCGCC GCCGAGTCCG GCCTGATCGC CGACAGCTGG ATCCGCACCA ACGCGCAGTT CGTCACCGCG CTCACCTCGC AGCGGGTGTC GAGCAACGTG ATCCGCTTCT TCATCGCGCT CTCGGTGTCC TTCGGCATCG CCAGCGTGCT GGTGGTGTCG GTGGTCCAGC GCAGCAAGGA GATCGGCATC CTGCGGGCGA TGGGCGCCAC GCAGGCGCAG ATGCGGCGCA TCTTCCTGCT GCAGGGCGGC ATCGTGGGTT TCCTCGGATC CTTCCTGGGC TCGGCGCTGG CGTGGGCCTT CCTGATGCTG TGGCGGATGC TCGCGCGCAA CCCGGACGGC ACGCCGCTGT TCGACATCGG CGTCGAGCCC GCGCTGGTGG CGATCGCCGC GGGCGGCGCC AGCGTGGTCG GCATCCTCGC GGCGCTGCTG CCGGCACGGC GTGCGGCAGG GCTCGACCCG GTGGTGGCGA TCCGTGGCTG A
|
Protein sequence | MKSWMPFEWI AALRFLQEGR MQSLLIIVGV GVGVAVIVFM SALLSGLQAN LVRRTLSSQA HIVLLPAEEV ARPQASSGHD AIRLQKQAQR LRSIDQWQLL RDRLEAWPEI AAVSPAASGP AFAVRGDASK AVTLLGIEPA RYQQVIDLGG RITAGELRVG AGEAVIGIEL AKDLGADVGD KLRIRGAQGE AETLTVTGLF DLGNKGVNAR NVYVGLRTGQ TLLDLVGGVS NIDLALHDLD LAEDVAQRIA AESGLIADSW IRTNAQFVTA LTSQRVSSNV IRFFIALSVS FGIASVLVVS VVQRSKEIGI LRAMGATQAQ MRRIFLLQGG IVGFLGSFLG SALAWAFLML WRMLARNPDG TPLFDIGVEP ALVAIAAGGA SVVGILAALL PARRAAGLDP VVAIRG
|
| |