Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_1703 |
Symbol | |
ID | 7084123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 1911356 |
End bp | 1913509 |
Gene Length | 2154 bp |
Protein Length | 717 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643698724 |
Product | peptidase M1, membrane alanine aminopeptidase |
Protein accession | YP_002355354 |
Protein GI | 217970120 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.023595 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCGCG GGCTGTTCGG GTGGCTGCTG CTTGCCGGCT GGCTGGGCGG GCTTCCTTCC GCCCTGGCCG CGGCCGGGCC GGCCAGGCTG CATCTGGCGG TCGAGCTCGA TCCGGCGCGC GGCGAGCTTC AGGTCGAGGC CGTGCTGCGT GCGCCCGAAG CCGGCTACCG CTTCGTGTTG CACGAATCCC TGCAGCCGCA GGCCGCGAGC GCCGACGGGC ATGCGCTGCC GCTGCGTGCC GCGGGCAGCC GCGGTACGTT GCGCGCCTGG CGGGTGGATG CGCCGGCCGG CGCCGAGCTG CGCCTGCGCT ACGGCGGCAA GCTGCCCGCG CTCGAGCAGG CACGCGACCA CCGCGAGGTG CTGCAGGCCA TGCCGCCGAT GGCCTCGCCA GCGGGCAGCT TCCTCGCCAG TGGCAGCGGC TGGTATCCGC GTCCGGCCGA CCTCTTCGCC TACGAGGTGG ACCTCGTGGT CCGTGGCGGC CAGCCGGCGC TGGTCGCCGG ACGACTGGCG CACGAGCAGC GCCCTGCCGC GGCGGGCGAG CCCTACCGCG CGCGCTTCGT CTTCGAACAT CCGGCAGACG GCATCGACCT GATGGCCGGG CCCTGGGTGG TGCGCGAACG CCTCGCCACC CAGGCCGACG GCCGGCCGCT GCGCCTGCGC ACCTACTTTC CCGCCGCGCT CGACGGCGAG GCCGGGCTGG CCGAGGCCTA CCTCGCCGAC AGCCAGCGCT ACATCGAGCG CTACTCGGCA CAGATCGGCG CCTACCCCTT CACCGAGTTC TCGGTGGTGG CGAGCCCGCT GCCGACCGGC TTCGGCATGC CCACGCTGAC CTACATCGGC GAGCAGGTGC TGCGCCTGCC CTTCATCCGC GCGAGCTCGC TCGGCCACGA GGTGCTGCAC AACTGGTGGG GCAACGGCGT GCTGGTCGAC TACGCGCGTG GCAACTGGTC GGAGGGGCTG ACCACCTTCA TGGCCGACTA CGCCTACAAG GCCGAGGCGT CGGCCGCCGC GGCGCGCGAG ATGCGCCTGG GCTGGCTGCG CGACTTCGCC GCACTGCCGG CGGATTCGCA CGCACCGCTC GCCGATTTCC GCTCGCGCAC CCACGGCGCC GCGGCCGCGG TGGGTTACGG CAAGGCGGCG ATGGTCTTCG TGATGCTGCA GGACGAAATC GGCGAGGACG CGTTCGCGCG CGGCATCCGC CTTTTCTGGG AACGCCAGCG CTTCCGTATC GCCGCTTGGG ACGAACTGCG CGCCGCCTTC GAGGAGGCAG CCGGGCGCCC GCTGCAGGGC TTCTTCCGCC AGTGGCTGGA GCTGCCCGGC GGGCCGGCGC CGCGCATCGA ACGGGCACGC TTGGTGGAGC AGGCCGGGCA GGGCGCGCGC GTGCGCGTGA GCCTGGCGCA ACCGGGACCG GTGCACGCCT TGCGGCTGCC ACTGCAGCTC GTGGCGGGGG ATCGGGTAGA GACCCGGGTC ATCGAGTTCG CCGCGGGCCG TACCGATGTG GAGCTGACCG CCGGCTTCGT GCCCGAGGGC GTGGCCCTCG ACCCGGAGCT GCGTCTGTGG CGCCTGCCCG AGGCCGCGCA ACTGCCGCCC ATCCTGCGCC AGTGGATCGT CGCCGCCGCG CCGCGGCTGG TGCTCGCCGA CGCACAGGCG GAGGCGGGCG GGGCGGGGAG CGGCGCAGTG AGTGGCGCAG TGAGTGGCGC AGGGAGTGGC GCAGGGAGTG GCGCAGGGAG TGGCGCAAGC GGTGGCACGC CGAGAGGGGC GGACAGTGCA TTCGCCGAGG CGGCGCAGGC GCTCGCCGCC CGCCTGTTCG AGCGCACGCC GACGCTGGCA GGTCTCCCAG CCTTGGGCGC AGAGGGCGGG CCGGTGCTGC TGGTGGGCAG CGAGCCCGCG GTGGCCGCGG TGCTCGTCGG TGCCGGCCTG GCGGCACAGC CCCGGGGCAT GCCCGCAGGC GGCAGCGCCA GGGTGTGGAC CGTGCAGCGT GAGCGCGGCC CGGCGCTCGC CGTGATCGCC GCCGCCGACC CCGCCGCGCT GCGCGCGCTG CAGCGCGCGT TGCCACACTA CGGCAGCCAG AGCTGGCTGG TCTTCGACGG CACGCGGGCA AGCGCCCGCG GCGTATGGGA CGCGCCTGGC AATGTGGTGA AGGTGGAGCC TTGA
|
Protein sequence | MLRGLFGWLL LAGWLGGLPS ALAAAGPARL HLAVELDPAR GELQVEAVLR APEAGYRFVL HESLQPQAAS ADGHALPLRA AGSRGTLRAW RVDAPAGAEL RLRYGGKLPA LEQARDHREV LQAMPPMASP AGSFLASGSG WYPRPADLFA YEVDLVVRGG QPALVAGRLA HEQRPAAAGE PYRARFVFEH PADGIDLMAG PWVVRERLAT QADGRPLRLR TYFPAALDGE AGLAEAYLAD SQRYIERYSA QIGAYPFTEF SVVASPLPTG FGMPTLTYIG EQVLRLPFIR ASSLGHEVLH NWWGNGVLVD YARGNWSEGL TTFMADYAYK AEASAAAARE MRLGWLRDFA ALPADSHAPL ADFRSRTHGA AAAVGYGKAA MVFVMLQDEI GEDAFARGIR LFWERQRFRI AAWDELRAAF EEAAGRPLQG FFRQWLELPG GPAPRIERAR LVEQAGQGAR VRVSLAQPGP VHALRLPLQL VAGDRVETRV IEFAAGRTDV ELTAGFVPEG VALDPELRLW RLPEAAQLPP ILRQWIVAAA PRLVLADAQA EAGGAGSGAV SGAVSGAGSG AGSGAGSGAS GGTPRGADSA FAEAAQALAA RLFERTPTLA GLPALGAEGG PVLLVGSEPA VAAVLVGAGL AAQPRGMPAG GSARVWTVQR ERGPALAVIA AADPAALRAL QRALPHYGSQ SWLVFDGTRA SARGVWDAPG NVVKVEP
|
| |