Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_2486 |
Symbol | |
ID | 7979040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 2512149 |
End bp | 2513243 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 644799287 |
Product | aminodeoxychorismate lyase |
Protein accession | YP_002950447 |
Protein GI | 239827823 |
COG category | [R] General function prediction only |
COG ID | [COG1559] Predicted periplasmic solute-binding protein |
TIGRFAM ID | [TIGR00247] conserved hypothetical protein, YceG family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00502838 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAATG ACCAATTTTT TCAAAAAGAT GCACAAATTG TACGAAAGAT TGTGTTGATT GTTTGCATCG CTGTTTTTAT CGCATGCATC GCTATAGCAG GAGGAAGCTA TTTCTATATA AAATCAGCGT TACAACCCGT CGATCCGGAT GACCGCACTC CTGTTCATAT ATCGATACCG ATCGGCTCTT CTGTTAATGA TATTGCCAAT ATGTTAGAAG AAAAACAGCT AATTAAAAGC TCATTGGTGT TTCGCTATTA CGTGAAGTTG AAAAACCATG TTGGTTTTCA AGCGGGAGAA TACCGATTAA ATCGATCCAT GTCGATGGGA GATATTATTG CCGTCTTAAA AACGGGAAAA GTAACAGAAA AAAAGGGGTT AAAGCTGACT ATTCCGGAAG GGACGCAAAT TACGCAAATT GCTGCTATTA TTGCGGAGAA AACAGGGTAT AAGAAAGAAG AAGTGCTTCG ACAATTAAAT GACCGCAAAT ATATCGAAAA TCTTATTCAA AAATACCCGT CCATTTTATC CAAAGATATT TTAAATAAAA ATATACGCTA CCCACTAGAA GGTTATTTGT TTCCTGCTAC TTATTCCTTT CATGAAAAAA AGCCTTCTAT TGCGGAGATT GTGGAAACGA TGTTAAGAAA AACAGAAAAA GTATTAGCAA AGTATGAACG TGATAAGAAA GAAATGAACA TGACGACACA TCAGCTTTTA ACCATGTCTT CCTTAATTGA AGAAGAAGCG ACAGAAAAGG CAGATCGCGA AAAAATCGCG AGCGTGTTTT ACAACCGTCT CCGTATAGGC ATGCCGTTGC AGACAGACCC TACTGTCTTA TATGCGCTTG GTAAACATAA AGATCGCGTC TATTATAAAG ATTTAGAAGT GAAATCTCCT TATAATACGT ATATTCATAA AGGACTTCCT CCCGGACCGA TTGCGAACGC TGGGGAAATG TCGATTCGAG CGGCACTGAA ACCTGCGAAA ACCGATTATT TGTATTTTCT TGCCACACCA GCAGGGGATG TCATTTTTAC AAAGACATTG GAAGAACATA ATCGCGAGAA AGAAAAATAC ATTGGAAAAC AATAG
|
Protein sequence | MYNDQFFQKD AQIVRKIVLI VCIAVFIACI AIAGGSYFYI KSALQPVDPD DRTPVHISIP IGSSVNDIAN MLEEKQLIKS SLVFRYYVKL KNHVGFQAGE YRLNRSMSMG DIIAVLKTGK VTEKKGLKLT IPEGTQITQI AAIIAEKTGY KKEEVLRQLN DRKYIENLIQ KYPSILSKDI LNKNIRYPLE GYLFPATYSF HEKKPSIAEI VETMLRKTEK VLAKYERDKK EMNMTTHQLL TMSSLIEEEA TEKADREKIA SVFYNRLRIG MPLQTDPTVL YALGKHKDRV YYKDLEVKSP YNTYIHKGLP PGPIANAGEM SIRAALKPAK TDYLYFLATP AGDVIFTKTL EEHNREKEKY IGKQ
|
| |