Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2689 |
Symbol | |
ID | 4077600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2828383 |
End bp | 2829603 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638008014 |
Product | allantoate amidohydrolase |
Protein accession | YP_614683 |
Protein GI | 99082529 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0667838 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTGGG GACAGGAAGC CCAGCAACGC CTGGCGAAGA TTGCGGCCTG CAGCGCAGCC GGACCCGGTG TAACCCGCCT GCCCTATACG CCCGAACATT CCGCAGCACT GGAACAGATC AGCGACTGGA TGCGGCGCGC CGGGCTCCAC CCACAGCTTG ATGCGGCCGC GACCCTGGTG GGGCGCAGCA GTGCGCCCTC CAACGAAGCC GCCGTTCTGA TCGGGTCGCA TCAGGACAGC GTGATCGAAG GCGGGCGCTA TGATGGCATC ATGGGGATTG TGATCGGCTG TCTGGCGCTT GAACGGCTGG CATCCGAAGG CACGCGACTC CCCTTTCCGG TCGAGGTACT GGCCTTTGCC GACGAGGAGG GCGTGCGCTT TCCCACCGCT CTTGTTGGCT CCCGCGCCCT TGCGGGCCGT TTCGACCCCA GCGTTCTGGA CATGCGCGAC GGCGAGGGCG TGACGCTCCG CACCGCGCTG TCGGAGTTTG GGGGCAGACC CGATAAAATC GCCTCTGAAG CCCGCAACAA GAACGCGGTG CGCGCTTATC TGGAACTTCA CATCGAGCAA GGTCCGATGC TGGAGCAGGA CAACGCAGCG GTTGGAATCG TCACTGGCAT CTGCGGTATC GAGCGCAACA GCGTTTCATT TGTCGGGGAG ACCGGCCATG CGGGCACCGT CCCGATGCAG GGGCGGCGCG ATGCGCTGGT GGCCGCGTCC GAGTTCGTGG TGAAAATCCA TGATGCCGCA CGAAACATCG ATGGGCTGCG CGCCACCATT GGCACGCTGG CGCTGAAACC GGCAGCCGTG AACGCCATCC CGCGCGAGGC GGCGCTGACG CTTGAGATCC GGGCGCTTTC GGATGCGGCG CGACAGGAAT TTGCCGGTGC CGCGCAGGTC ATCGGCACCG AAATCGCTGC CAAACGGGAT GTGAGCTTTG ACATGGCAAA AACCTACGAG CAACTCGCCG TGCCCTGCGC ATCGGGGTTG ATCGAAACGC TGGAGCTGGC CGCCCGGGAT GCCGGACAAC ACGCACCCCT GCTGCCCTCG GGGGCGACGC ATGATGCCTC AGCGATGGCG GATCTATGCG ACATTTCAAT GCTGTTCCTG CGCTGTAAAG ACGGGTTCAG TCACCGCCCG GAGGAATATA CCTCTGCCGA GGACATGGCG GCAGCGATTG ATGTTACCTG CGCTTTCCTG CGCCGTCTTG CCGCGGAGTA G
|
Protein sequence | MDWGQEAQQR LAKIAACSAA GPGVTRLPYT PEHSAALEQI SDWMRRAGLH PQLDAAATLV GRSSAPSNEA AVLIGSHQDS VIEGGRYDGI MGIVIGCLAL ERLASEGTRL PFPVEVLAFA DEEGVRFPTA LVGSRALAGR FDPSVLDMRD GEGVTLRTAL SEFGGRPDKI ASEARNKNAV RAYLELHIEQ GPMLEQDNAA VGIVTGICGI ERNSVSFVGE TGHAGTVPMQ GRRDALVAAS EFVVKIHDAA RNIDGLRATI GTLALKPAAV NAIPREAALT LEIRALSDAA RQEFAGAAQV IGTEIAAKRD VSFDMAKTYE QLAVPCASGL IETLELAARD AGQHAPLLPS GATHDASAMA DLCDISMLFL RCKDGFSHRP EEYTSAEDMA AAIDVTCAFL RRLAAE
|
| |