Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2065 |
Symbol | |
ID | 7977300 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 2124397 |
End bp | 2125638 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 644798880 |
Product | peptidase M29 aminopeptidase II |
Protein accession | YP_002950050 |
Protein GI | 239827426 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2309] Leucyl aminopeptidase (aminopeptidase T) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.610998 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCGACT GGGAACAAAA TTTAGAAAAA TATGCCGCGC TCGCCGTACA AGTCGGCGTC AACGTCCAAA AAGGGCAAAC GCTCTTTGTA AACGCGCCGC TTGTGGCTGC TCCGCTTGTA CGGAAAATCG CGAAAAAAGC ATACGAAGTT GGCGCCAAAC ATGTCTATGT CGAATGGAAT GACGAAGATC TTACATACAT TAAATTCAAG TATGCTCCAG ATGAAGCGTT TTTGGAATAT CCAATGTGGC GTGCGAAAGG AATGGAACAG CTTGCGGAAG AAGGTGCGGC GTTTTTATCC ATCTATTCGC CAAACCCGGA TTTATTGAAA GACATTGATC CGAAACGGAT CGCAACGGCA AATAAAACCG CATCTCAAGC ATTGCGCAAC TATCGCAGTG CCTTAATGGC GGATAAAAAC TGTTGGTCAT TGATCTCCGT TCCTACGCCG GCGTGGGCGA AAAAAATATT TCCAGACCTC AGTGAAGAAG AAGCGATCGA CAAGCTATGG GAAGCGATAT TCCGCATTAC CCGCGTCGAC CAGGACGACC CTATCCAAGC GTGGCAGCAA CATAACGACC GACTCGCCAA AATCGTTGAT TACTTAAACA ATAAACAATA TCAACAACTC ATCTATGAAG CGCCTGGAAC AAACTTAACG ATCGAACTTG TAGAAAACCA TGTATGGCAT GGCGGTGCAG CCGTCAGCGA GAAAGGCGTT CGTTTCAACC CGAACATCCC GACAGAAGAA GTGTTTACGA TGCCGCATAA AGACGGGGTC AATGGCAAGG TACGCAATAC CAAACCGCTC AATTATAACG GCAACCTGAT AGATGGATTT ACCCTTACGT TTAAAGATGG AAAAGTCGTT GACTTCACTG CAGAAAAAGG ATATGAAATA TTAAAGCATT TATTGGACAC GGACGAAGGT GCGCGCCGAT TAGGAGAAGT GGCACTCGTT CCACATCAAT CACCGATTTC CACATCGAAT TTAATTTTCT ATAACACATT ATTCGATGAA AACGCCGCAT GTCACCTAGC GCTCGGAAAA GCATACCCAA CGAACATTCA AAACGGCACC GCTATGTCCA AAGAAGAGCT CGACAAACAT GGGGTCAACG ATAGCCTCAT CCATGAAGAT TTTATGATCG GCTCTGCCGA ACTAAACATC GATGGCGTCA CGAAAGACGG CAAGCGCGAA CCAATTTTCC GCAACGGAAA CTGGGCGTTT GAATGGAAAT AA
|
Protein sequence | MSDWEQNLEK YAALAVQVGV NVQKGQTLFV NAPLVAAPLV RKIAKKAYEV GAKHVYVEWN DEDLTYIKFK YAPDEAFLEY PMWRAKGMEQ LAEEGAAFLS IYSPNPDLLK DIDPKRIATA NKTASQALRN YRSALMADKN CWSLISVPTP AWAKKIFPDL SEEEAIDKLW EAIFRITRVD QDDPIQAWQQ HNDRLAKIVD YLNNKQYQQL IYEAPGTNLT IELVENHVWH GGAAVSEKGV RFNPNIPTEE VFTMPHKDGV NGKVRNTKPL NYNGNLIDGF TLTFKDGKVV DFTAEKGYEI LKHLLDTDEG ARRLGEVALV PHQSPISTSN LIFYNTLFDE NAACHLALGK AYPTNIQNGT AMSKEELDKH GVNDSLIHED FMIGSAELNI DGVTKDGKRE PIFRNGNWAF EWK
|
| |