Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2745 |
Symbol | |
ID | 7979148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2782652 |
End bp | 2783728 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 644799542 |
Product | Glutamyl aminopeptidase |
Protein accession | YP_002950701 |
Protein GI | 239828077 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000840882 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGTAG AAACGCTTCG ACTGTTTCAA ACATTGACCG AATTGCCGGG AGCACCAGGC AATGAACATG CGGTGCGGAA TTTCATGCGC AAAGAGCTGG AAAAGTACGC GGATGAAGTC GTACAAGACC GGCTTGGCAG CATCTTTGGC GTCAAACGCG GCGATGAAAA CGGTCCTACC GTGATGGTTG CAGGGCATAT GGACGAAGTC GGCTTTATGG TCACCGCTAT TACCAATAAC GGTATGATTC GTTTTCAGCC GCTTGGCGGC TGGTGGAATC AAGTATTGTT AGCACAGCGC GTACAAATCA TTACCGATCA TGGTCCAGTT GTTGGAGTGA TCAGCTCGAT TCCGCCGCAT TTGTTGAGCG AAGAACAACG AAACAAGCCG ATGGAGATCA AAAACATGCT CATCGACGTT GGTGCTGATG ACCGTGAAGA CGCGAAAAAA ATGGGGATTA AACCAGGACA ACAAATTGTA CCGATTTGTC CATTTACCCC AATGGCCAAT CCGAAAAAAA TTTTGGCAAA AGCGTGGGAC AATCGTTATG GCTGTGGATT GGCGATTGAA TTGTTGAAAG AGTTGAAGGA TGAGAAACTG CCAAACGTGC TATATTCAGG TGCTACTGTC CAAGAAGAAG TCGGGTTGCG CGGGGCGCAA ACCGCCGCAA CCATGATTCA GCCTGATATC TTTTTCGCGT TAGACGCAAG CCCGGCGAAC GATATGACCG GAGACGCGAA AGAATTTGGG CATCTTGGAA AAGGGGCGCT TGTCCGCATT TATGACCGTT CGATGGTGAC TCATCGCGGT ATGCGTGAAT TTGTATTAGA TACGGCGGAA ACGCACGGCA TTCCATATCA ATTTTTCGTT TCACCGGGTG GAGGAACGGA TGCCGGAAGA GTGCATATCG CCAACAGTGG AGTGCCTTCT GCCGTGATCG GTATTTGTTC CCGTTATATT CATACACATG CGTCGATCAT TCACGTCGAT GATTATCAGG CAGCAAAGCA ACTGCTTATT GAACTTGTAA AGCGGTGCGA TAAAGCAACA GTCGATGCAA TTAAGAAAAA CAGCTAA
|
Protein sequence | MNVETLRLFQ TLTELPGAPG NEHAVRNFMR KELEKYADEV VQDRLGSIFG VKRGDENGPT VMVAGHMDEV GFMVTAITNN GMIRFQPLGG WWNQVLLAQR VQIITDHGPV VGVISSIPPH LLSEEQRNKP MEIKNMLIDV GADDREDAKK MGIKPGQQIV PICPFTPMAN PKKILAKAWD NRYGCGLAIE LLKELKDEKL PNVLYSGATV QEEVGLRGAQ TAATMIQPDI FFALDASPAN DMTGDAKEFG HLGKGALVRI YDRSMVTHRG MREFVLDTAE THGIPYQFFV SPGGGTDAGR VHIANSGVPS AVIGICSRYI HTHASIIHVD DYQAAKQLLI ELVKRCDKAT VDAIKKNS
|
| |