Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_0436 |
Symbol | |
ID | 5344922 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | - |
Start bp | 492634 |
End bp | 493764 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640838016 |
Product | amidohydrolase |
Protein accession | YP_001373786 |
Protein GI | 152974269 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00908156 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAC TATTCAAGCA AGCCATTGTA TACCCAATTA CATCTCCAAA GTTGCAAGGG GATGTATTAG TCGTAGAAGA CAAAATCGGC GAAGTGAAAC CATATATTGA GGCAACCGAA GATATGACCG TCATCGATGC AAGAGCACTT CACCTTTTAC CCGGATTTAT CGACGTTCAT ACTCATCTTG GATTATATGA CGAAGGAACT GGTTGGGCTG GAAATGATGC CAATGAAACC TCCGAAGTAT CAACTCCACA TATCCGTTCT TTAGATGGCA TACATCCCTT TGATATCGCA TTTCACGATG CTGTAAAGGG CGGCATTACA ACCGTTCATG TCATGCCTGG GAGTCAAAAT ATTATTGGTG GCACCACATG CGTGATTAAA ACGGCTGGCA CTTGCATTGA TCATATGGTT ATACAAGAGC CAGCAGGATT AAAAATTGCT TTTGGGGAAA ATCCAAAGCG TATTCATAGT AATGGTACAA AAGAATCCAT TACCCGCATG GGTATTATGG GATTGCTGCG CGAATCTTTC TATGAAGCAC AACATTATGA CAATGATGCT GATTTTCGGA TGCTCCCTAT TTTAAAAGCA TTGCGCCGGG AAATCCCCGT ACGCATTCAT GCACATCGCG CGGATGATAT TCGTTCTGCT CTTCGTTTTG CGAATGAATT TCACCTCGAC TTGCGCATCG AACATTGCAC AGAAGGACAT TTCATTGCTG ATGAACTTGG ACAACATCAT TTAAAAGTTT CTGTCGGCCC AACACTTACA CGTCGTTCAA AAATTGAACT CAAAAATAAA TCTTGGGACA CATATCACAT TCTCGCTCAA AAAGGAGTAG AGGTTTCCAT CACAACTGAT CACCCTTATA CCCCCATTCA ATATTTGAAT ATTTGCGCAG CCCTTGCTGT TAGAGAAGGG TTAGATGAGA AAACAGCCTT AGAAGGCATC ACGATTTTAC CAGCACGAAA CTTACGTTTA GAAAATAAAA TTGGGAGCAT TGAGTCTGGA AAAGATGCCG ATCTTGTTTT ATGGACACAC CATCCATTTC ATTATTTAGC TAAACCTGTT CTGACTATGA TTGATGGGAA AATTATTTAC AAAAAAAATA AAAAAAACTA G
|
Protein sequence | MKILFKQAIV YPITSPKLQG DVLVVEDKIG EVKPYIEATE DMTVIDARAL HLLPGFIDVH THLGLYDEGT GWAGNDANET SEVSTPHIRS LDGIHPFDIA FHDAVKGGIT TVHVMPGSQN IIGGTTCVIK TAGTCIDHMV IQEPAGLKIA FGENPKRIHS NGTKESITRM GIMGLLRESF YEAQHYDNDA DFRMLPILKA LRREIPVRIH AHRADDIRSA LRFANEFHLD LRIEHCTEGH FIADELGQHH LKVSVGPTLT RRSKIELKNK SWDTYHILAQ KGVEVSITTD HPYTPIQYLN ICAALAVREG LDEKTALEGI TILPARNLRL ENKIGSIESG KDADLVLWTH HPFHYLAKPV LTMIDGKIIY KKNKKN
|
| |