Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3256 |
Symbol | |
ID | 4075398 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 257295 |
End bp | 258170 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 638004765 |
Product | allophanate hydrolase subunit 1 |
Protein accession | YP_611492 |
Protein GI | 99078234 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2049] Allophanate hydrolase subunit 1 |
TIGRFAM ID | [TIGR00370] conserved hypothetical protein TIGR00370 |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.350388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.431322 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGACGC GCTATACCTA CGGAGGGGAC GAACATGTCT ACGTCGAGAT CGACGATGAG ATGTCTCTCG ACGCCTTCTT TATCGCGCTC TCGCTGTCCA AACTGGTGCG CGAGGCCAAT ATTCCGGGCG TCACCGAAAT CTGCCCGGCC AATGCTTCGT TTGAGGTGCG CTTTGATCCC GATGTGATCG CGCCGGACGA GATGATGGCC CGCATTCGGG AACTCGAAGG CGCAGCCAAA GCGCAAGAGA AACGTCTCGA CACACGGATT ATCGAGATCC CGGTGTTTTA CGAAGACCCC TGGACGCATG AGACCCTGAT GCGGTTTCGC GAGCGGCATC AGGACCCCTC TGGCACCGAT CTGGATTATG CGGCGCGGAT CAACGGGTAT GACGATGCTG CGGCCTTCAT CGAGGCGCAT CATTCCCAAC CGTGGTTCGT GTCTATGGTA GGTTTCGTCT CAGGTCTGCC CTTCCTCTAT CAGCTGGTCG AGCGCCAGAA ACAGCTGGAG GTACCCAAAT ACCTGCGCCC CCGCACCGAT ACGCCCAAGC TCACGGTCGG CTATGGGGGC TGTTTTTCCT GCATTTACTC CGTGCGTGGG GCAGGCGGTT ACCAGATGTT CGGGATCACC CCGATGCCGA TCTTCGATCC CGAGCAGAAG ATCAGCTATC TCAAGGACTT CATGGTTTTT TTCCGCCCCG GTGACATCGT CAAATGGAAG CCGATCGACC GCGCAGAGTA TGACGCGATT ACCGCGCAGG TCGAAGCAGG CAGCTATGTG CCAAAGATCG CAGAGGTCGC CTTTGATCTC GATGCCTTCA ACAAGGACAT GACCGGTTTC AACGCCAAAC TGATGGAGGC CCTCAATGAC GCTTGA
|
Protein sequence | MKTRYTYGGD EHVYVEIDDE MSLDAFFIAL SLSKLVREAN IPGVTEICPA NASFEVRFDP DVIAPDEMMA RIRELEGAAK AQEKRLDTRI IEIPVFYEDP WTHETLMRFR ERHQDPSGTD LDYAARINGY DDAAAFIEAH HSQPWFVSMV GFVSGLPFLY QLVERQKQLE VPKYLRPRTD TPKLTVGYGG CFSCIYSVRG AGGYQMFGIT PMPIFDPEQK ISYLKDFMVF FRPGDIVKWK PIDRAEYDAI TAQVEAGSYV PKIAEVAFDL DAFNKDMTGF NAKLMEALND A
|
| |