Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_2787 |
Symbol | |
ID | 7873196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 3016349 |
End bp | 3017848 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643699709 |
Product | leucyl aminopeptidase |
Protein accession | YP_002889764 |
Protein GI | 237653450 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.151843 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATTTA CCATAAAGAC CGGCAGCCCG GCCAAGCTCA AGACTGGCCT GCTGGTACTC GGCGCGTTTG CCGAAGGCCG TCTCCCCGCC CTCTCGGCCG CGGCCGACGG TGCTGCCGAA GGCCGCCTGG CCGCCCTGAT CAAGCGTGGC GACCTCGAGG ACAAGGCGGG CGCCACGCTG CTGGTGCATG ACCTGCCCGG CGTGCACGCC GAGCGCGTGC TGCTGGTGAG CCTGGGCAAG CACGACGAGT TCGGCGACAA GGCGTACCGC GACGCGCTCG CCGCCGCGGC GAAGGCGATC TCCGCCGGGC CGGCGAAGGA CGCCGTGGTC GCGCTCGCCG ACGCAGAGCT CTCCGGTCGC GACATGGCGT GGCGGCTGCA GCAGGCCGCG CGCATCCTCG CCGACGGCGC CTACCGCTTC GACGCACTCA AGTCCGACAA GAAGACCCGG AAGGAGCGTG GCGCGAAGAA GCTCTGCCTG CTGGTGAGCT GCGAGCTCGG CGCCGAGCTC GACACCGCCG TCCTGCAGGG CCACGCGATC GCCAGCGGCA TGGCGCTGGC GAAGGACCTC GGCAACCTGC CCGGCAACCA CTGCACTCCC ACCCACCTGG CGGAAACGGC CGAGTCCCTG GGCAAGCAGT ACAAGTTCGA CGTCGAGGTG CTCGAGCGCG ACGACATGGA GAAGCTCGGC ATGGGCTCCT TCCTGTCCGT GGCGCGCGGC TCGCACCAGC CGCCCAAGTT CATTGTCATG CACTACAAGG GCGGCAAGGC CAAGGCGAAG CCGGTCGTGC TCGTCGGCAA GGGCATCACC TTCGACACCG GCGGCATCTC GCTCAAACCC GCAGCCGAGA TGGACGAGAT GAAGTTCGAC ATGTGCGGCG CCGCGAGCGT GCTCGGCACC TTCAAGGCGG TCGCGCAGAT GGGCCTGCCG ATCAACCTGG TCGGCCTCGT CCCCACGACC GAGAACATGC CCGGCGGCGG CGCCACCAAG CCCGGCGACG TCGTCACTTC GATGTCGGGG CAGACCATCG AGGTGCTCAA CACCGACGCC GAGGGCCGCC TGATCCTGTG CGACGCGCTC ACCTACGCCG AGCGCTTCAA GCCCGAGTGC GTGATCGACA TCGCCACGCT CACCGGCGCC TGCGTGGTCG CGCTCGGCAA GATCCCGAGC GGACTGCTCG CCAACGACGA CGAGCTCGCC GCCGAGATCC TGCGCCGTGG CACCGAGTCG GGTGACCGCG CCTGGCAGCT GCCGCTGTGG GACGAATACC AGGAACTGCT CAAGAGCAAC TTCGCCGACA TGGGCAACAT CGGCGGCCGT TACGCCGGCA CGATCACCGC GGCCTGCTTC CTGTCGCGCT TCGCCAAGGC CTACAAGTGG GCCCACCTCG ACATCGCCGG CACCGCCTGG GTGTCGGGCG ATGCCAAGGG CGCGACCGGC CGCCCGGTGC CGCTGCTGAG CAGCTTCCTG ATCGGGCGCG CCCGCGCCCA TGCGGCCTGA
|
Protein sequence | MEFTIKTGSP AKLKTGLLVL GAFAEGRLPA LSAAADGAAE GRLAALIKRG DLEDKAGATL LVHDLPGVHA ERVLLVSLGK HDEFGDKAYR DALAAAAKAI SAGPAKDAVV ALADAELSGR DMAWRLQQAA RILADGAYRF DALKSDKKTR KERGAKKLCL LVSCELGAEL DTAVLQGHAI ASGMALAKDL GNLPGNHCTP THLAETAESL GKQYKFDVEV LERDDMEKLG MGSFLSVARG SHQPPKFIVM HYKGGKAKAK PVVLVGKGIT FDTGGISLKP AAEMDEMKFD MCGAASVLGT FKAVAQMGLP INLVGLVPTT ENMPGGGATK PGDVVTSMSG QTIEVLNTDA EGRLILCDAL TYAERFKPEC VIDIATLTGA CVVALGKIPS GLLANDDELA AEILRRGTES GDRAWQLPLW DEYQELLKSN FADMGNIGGR YAGTITAACF LSRFAKAYKW AHLDIAGTAW VSGDAKGATG RPVPLLSSFL IGRARAHAA
|
| |