Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1949 |
Symbol | |
ID | 7978773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2008180 |
End bp | 2009166 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644798777 |
Product | urea amidolyase related protein |
Protein accession | YP_002949947 |
Protein GI | 239827323 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000276448 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCACGA TTCAAGTAAT ACATAGCGGA TTTTTCACAA CCGTGCAGGA CGGAGGACGC TTCGGATATC AAAAAGCCGG TGTATCCGTT GGAGGAGTGA TGGATTCATT TGCGAGCCGG ATTGCGAATT TGCTTGTCGA AAATGACGCG AATGAGGCGA CATTAGAAAT TACGATGAAC GGCCCGACGC TTCGCTTTGA AACGGATGCG CTCGTTGCAA TTTGCGGCGG CGTGTTTCGC TGTGCATTAA ACGGCGAACC AATATCGATG TGGAAGCCGC TTGTCATCCG ACGCGATGAT GTGTTATCGA TCGGGGCGTG TCAAGGCGGC TATCGCGCAT ATATCGCGTT TGCGGGCGGG CTTAACATTC CGATTGTGAT GAACAGCCGT TCTACCCATG TACAAGCGCA CATCGGCGGT TTTCATGGAA GAGCGCTGCA GCCGGGGGAT GTATTGTCCC TGAGATCCAA GACGATAACG ATTCCAAAAA ATCCTATTCG TTGGGGGATT GCGTTTTCTG CGAGAAATTA CATAAAAGGG AAAAGAAAAA TGATACGTGT CGTGAAAGGC CCGGAATATA ACATGTTTAC CGAGCAAAGC CTAGAGCGAT TTTTTTCTTC CGCGTATGAA GTGACGACGC AATCGGACCG CATGGGCTAT CGCTTGCAAG GGCAAGCTCT TGAGCGCAAG ACGAATCAAG AAATGATTTC GGAAGCGGTC ACGTTTGGAA CGGTGCAAGT GCCTGCTTCC GGCCAGCCGA TTGTATTGAT GGCCGATTGC CAGCCGACAG GAGGGTATCC GCGCATCGCT CAAGTAATTA GCGTAGACCT TCCGATTTTA GCGCAGGCGC GCCCAGGCGA TCATATTCAA TTTCAAAAAG TATCGTGGCA AGAAGCACAA CGGTTATATA TCGAACGAGA GCAAGAAATA AAAAAATGGA AAATGATCAT TCATCAAAAA TGGAGGGAAA TCGGCTATGC GCGTTGA
|
Protein sequence | MSTIQVIHSG FFTTVQDGGR FGYQKAGVSV GGVMDSFASR IANLLVENDA NEATLEITMN GPTLRFETDA LVAICGGVFR CALNGEPISM WKPLVIRRDD VLSIGACQGG YRAYIAFAGG LNIPIVMNSR STHVQAHIGG FHGRALQPGD VLSLRSKTIT IPKNPIRWGI AFSARNYIKG KRKMIRVVKG PEYNMFTEQS LERFFSSAYE VTTQSDRMGY RLQGQALERK TNQEMISEAV TFGTVQVPAS GQPIVLMADC QPTGGYPRIA QVISVDLPIL AQARPGDHIQ FQKVSWQEAQ RLYIEREQEI KKWKMIIHQK WREIGYAR
|
| |