Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1338 |
Symbol | |
ID | 7978141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1405267 |
End bp | 1406571 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 644798275 |
Product | Acetamidase/Formamidase |
Protein accession | YP_002949448 |
Protein GI | 239826824 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.702518 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCAA AACAAACCGT ATTTGTCAAC GAGTTCACGA ACGGAATTTT AGATCCTAAC GGTGAAATGC TAGGTCCTGT GCAAGATGGC GGATATATTG TGGCCAATAC GGCCCCTGGC TGCTGGGGAC CGATGATCAC CCCTTGTATT CGCGGCGGCC ATGAAGTCAC GAAACCTGTT TTCGTAGAAG GCGCAGAAGT GGGAGACGCG ATCGCGATCA AAATTAAATC GATCCGCGTC ACTTCCATTG CTACGTCCTC CGGAAATGAT AAGCCGATGG AAGGAAGATT TGTAGGAGAC CCATTTGTCG CCGTGAAATG TCCTGAATGC GGAACGATGT ATCCAGAGAC AAAAATAGAA GGAATAGGCC AAGAAGCAAT TCGTTGCGCC AATTGCGGCG CGGACGTCAC TCCTTTTGTG TTTACGAACG GTTATACGAT GGCGTTTGAT TCCAATAAAA AAGTCGGCAT CACGCTGCAT AAAGAAGCGG CAGAACATAT CGCCCAACAA GGGCGCTATT ACATGGCGAC CCCTGACAAC TCGGTGCAAA ACCCGATCGT CACTTTTGCA CCTCACGATT TAGTAGGAAC AGTCACTAGA CTTCGACCTT TCCTTGGACA ACTTGGCACA ACGCCGGCAC GTCCATTTCC AGACTCGCAC AACGCGGGGG ACTTCGGGCA GTTTCTCGTT GATGCCCCGC ATGAATACGG GATCACAAAA GAGCAGCTCG AAGACCGCAC AGACGGACAT ATGGATATTA ATCGCGTGCG GGAAGGTGCC GTCTTAATCT GCCCTGTGAA AGTGCCGGGC GGCGGCGTTT ATCTCGGGGA TATGCACGCC ATGCAAGGAG ATGGGGAAAT CGCGGGACAT ACAACAGATG TAGCTGGTAT CGTCACGCTT CAAGTAAAAG TGATTAAAGG TTTAACGATT GAAGGTCCGA TCCTTCTCCC TGTCGAAGAA GACCTTCCAT ATCTAGCAAA GCCAATAACG AAAAAAGAAA AAGAAATTGC GCTTGACCTC GCACAAACAT GGGGCGTGAA AAAGCTAGAA GAATCCCTTC CGATTTCCTT TATCGGAACA GGCTCAAACT TGAATGAAGC GACGGAAAAC GGATTACAAA GAGCGGCGAA TGTGTTAGGC ATTTCCGTCC CTGAAGTGAT GAACCGCGCA ACGATCACCG GCGCGATCGA AATCGGCAGA CACCCAGGAG TCGTCACTGT CACATTCCTA TGTCCTGTCC GTTATTTAGA CAATATTGGC CTTACGACAT TAATCCGAGA TCAATATCGC GATAGTTTGG AATAA
|
Protein sequence | MEAKQTVFVN EFTNGILDPN GEMLGPVQDG GYIVANTAPG CWGPMITPCI RGGHEVTKPV FVEGAEVGDA IAIKIKSIRV TSIATSSGND KPMEGRFVGD PFVAVKCPEC GTMYPETKIE GIGQEAIRCA NCGADVTPFV FTNGYTMAFD SNKKVGITLH KEAAEHIAQQ GRYYMATPDN SVQNPIVTFA PHDLVGTVTR LRPFLGQLGT TPARPFPDSH NAGDFGQFLV DAPHEYGITK EQLEDRTDGH MDINRVREGA VLICPVKVPG GGVYLGDMHA MQGDGEIAGH TTDVAGIVTL QVKVIKGLTI EGPILLPVEE DLPYLAKPIT KKEKEIALDL AQTWGVKKLE ESLPISFIGT GSNLNEATEN GLQRAANVLG ISVPEVMNRA TITGAIEIGR HPGVVTVTFL CPVRYLDNIG LTTLIRDQYR DSLE
|
| |