Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | DET0840 |
Symbol | purB |
ID | 3229872 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dehalococcoides ethenogenes 195 |
Kingdom | Bacteria |
Replicon accession | NC_002936 |
Strand | + |
Start bp | 769007 |
End bp | 770362 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637120404 |
Product | adenylosuccinate lyase |
Protein accession | YP_181567 |
Protein GI | 57234405 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0015] Adenylosuccinate lyase |
TIGRFAM ID | [TIGR00928] adenylosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGAGC GATACAGCCG CCCTCAAATG AAAAAAGTCT GGTCAGACGA AAGCAAATTC GGTTACTGGC TGGATATTGA AATTGCAGTC TGCGAAGCCT GGGCTAAAAT TGGCGTGATT TCCCGTGAGG ATATTACCAA AATCAAGCTG GCGCGTTTGA ATTTCAAACG TATGGAAGAA CTACTAAAAG AAACCCACCA CGATATGACT GCCTTTCTGG GCTCGGTGGC CGAAAGCTTG GGTGATGAAT CCCGCTTTAT CCATATGGGC ATGACTTCTT CAGACGTTAT GGACACCGCC CTCAGCCTTC AGCTGGTCGA AGCCTCCAAG ATACTTAACA GCGGCATTAA AGAGCTGATA AATGCCCTGG CTGCCAAAGC TATGGAGTAT AAATATACTG TTCAGGTAGG GCGTACCCAC GGCGTGCACG CCGAACCCAT TTCATTCGGG CTGAAACTGG CACTCTGGAT GGAAGAAATG AAGCGTAACC GCCAGCGCCT TGCGGACGCC ACCAAAGCTA TTACGGTGGG CAAAATGTCA GGTGCGGTAG GCACATATGC TACTTTATCA CCCGAAATTG AAGAAATAGC CTGTAAAAAA CTGGGGCTTT CCCCGGCTTC CATTTCCAAT CAGGTAATCC AGCGTGACCG TCATGCTCAG TACATGACTA CTCTGGCCAT TATCGCCGGT TCGCTGGAGA AATTTGCTAC CGAAATACGG GCTCTTCAGA AGACTGAATG CCACGAGGCT GAAGAACCAT TTGAAAAAGG GCAAACCGGT TCGTCAGCTA TGCCTCATAA GAAAAATCCT GAGCTTTGCG AACGAATTTG CGGTATTGCC CGTATAATAC GCGGTTATTC CGTTACCGCC ATGGAAAATC AGCCCTTGTG GCATGAGCGG GATATCAGCC ACTCCTCTAC CGAACGGGTA ATAATGCCTG ACGGCTGTCT GCTGCTGGAT TACGCTTTGC ACATTTTTAC CAATGTTATA AAGGGTCTAA ATGTTTTCCC TGAACAGATG GAAAAGAATC TTAACCTTAC CGGCGGTCTG GTTTATTCAC AGAGAGTTAT GCTTGCCCTG ATAAACAAGG GGCTTTCACG CCAGCAGGCA TACAAGATGG TGCAGCGGAA TGCTATGCGC ACCTGGCAAG GCGAAGATAA TTTTATGAAC CTGCTCAAGG CGGATACCGA GGTTATGGAA CACCTTTCTT CTGCCGAAGT TGACGAATTA TTTGACTACA AATTTTATCT GCGCTACATA GATGATATAT TCAGACGGGT AGGGCTGACT AATTCCCAGT GGAAGAAAGG CGGGGATGCC TCCTCTGACG AAGGACTGGC CCCCAGAGCT ATATAA
|
Protein sequence | MIERYSRPQM KKVWSDESKF GYWLDIEIAV CEAWAKIGVI SREDITKIKL ARLNFKRMEE LLKETHHDMT AFLGSVAESL GDESRFIHMG MTSSDVMDTA LSLQLVEASK ILNSGIKELI NALAAKAMEY KYTVQVGRTH GVHAEPISFG LKLALWMEEM KRNRQRLADA TKAITVGKMS GAVGTYATLS PEIEEIACKK LGLSPASISN QVIQRDRHAQ YMTTLAIIAG SLEKFATEIR ALQKTECHEA EEPFEKGQTG SSAMPHKKNP ELCERICGIA RIIRGYSVTA MENQPLWHER DISHSSTERV IMPDGCLLLD YALHIFTNVI KGLNVFPEQM EKNLNLTGGL VYSQRVMLAL INKGLSRQQA YKMVQRNAMR TWQGEDNFMN LLKADTEVME HLSSAEVDEL FDYKFYLRYI DDIFRRVGLT NSQWKKGGDA SSDEGLAPRA I
|
| |