Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1498 |
Symbol | |
ID | 4077054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1603637 |
End bp | 1604887 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638006811 |
Product | allantoate amidohydrolase |
Protein accession | YP_613493 |
Protein GI | 99081339 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases |
TIGRFAM ID | [TIGR01879] amidase, hydantoinase/carbamoylase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCGC TTGGACAGAA TCTGAAAATC AATGGCGATC GGCTGTGGGA CAGTCTGATG GATATGGCCA AGATTGGCCC CGGTGTGGCC GGTGGCAACA ATCGCCAGAC GCTCACCGAT GAGGACGCCG AAGGCCGCGC CCTGTTTCAG TCGTGGTGTG AAGCGGCGGG CATGACCATG GGGCTCGACT CCATGGGCAA TATGTTTGCA ACCCGCCCGG GTGAAGATCC CGAGGCATTG CCGGTCTACA TGGGATCGCA TCTCGACACC CAGCCGACGG GCGGCAAATA CGATGGCGTT CTAGGGGTGC TTGGCGGTCT TGAGGTCGTG CGGACCATGA ATGACCTCGG CATCAAGACC AAGCACCCGA TCGTGGTGAC CAACTGGACC AATGAGGAAG GCACGCGGTT TGCTCCGGCT ATGCTGGCGT CGGGCGTGTT TGCCGGCAAA CATACGCAAG ACTGGGCCTA TGGGCGCGAG GATGCTGAAG GCAAGACCTT TGGCGACGAG CTGAAGCGCA TCGGCTGGGT TGGCGATGAA GAGGTCGGCG CCCGCAAGAT GCACGCCATG TTTGAGCTGC ACATCGAGCA GGGTCCGATT CTTGAGGCTG AGAAGAAAGA CATTGGCGTG GTGACACACG GTCAGGGGCT CTGGTGGTTG CAATGTACCG TGACCGGCAA GGATGCGCAC ACTGGCTCGA CCCCGATGAA TATGCGGGTG AACGCCGGGC TCGGCATGGC GCGGATGACG GAAGCGGCGC ATCAGATCGC CATGGCGCAT CAGCCGCATG CAGTGGGCGC AGTGGGGCAT TGCGATGTCT TCCCCAACTC GCGCAATGTG ATCCCGGGCA AGGTGGTGTT CACCGTGGAT TTCCGCTCTC CCGACCTTGA AAAGCTGACA TCGATGCGCA CGCAATACGA AGCCAAAGCA AAGGAAATCG CGGCGGAGCT CGGTCTCGGT CTGGAGATCG AGCCGGTGGG GCATTTCGAC CCGGTGACCT TTGATGAGAG CTGCGTGAGT GCGGTTCGGG GCGCGGCAGA GCGTTTGGGC TATAGCCACA TGGATATCGT CTCTGGCGCG GGGCATGATG CCTGCTGGAT CAATGATGTG GCACCCACCG CGATGATCAT GTGTCCTTGC GTGGACGGTC TGAGCCATAA TGAGGCGGAA GAGATTTCGA AGGACTGGGC CGCGGCCGGC ACGGATGTGA TGCTGCATGC GGTGCTTGAG ACGGCTGAGA TCGTCGCCTG A
|
Protein sequence | MTALGQNLKI NGDRLWDSLM DMAKIGPGVA GGNNRQTLTD EDAEGRALFQ SWCEAAGMTM GLDSMGNMFA TRPGEDPEAL PVYMGSHLDT QPTGGKYDGV LGVLGGLEVV RTMNDLGIKT KHPIVVTNWT NEEGTRFAPA MLASGVFAGK HTQDWAYGRE DAEGKTFGDE LKRIGWVGDE EVGARKMHAM FELHIEQGPI LEAEKKDIGV VTHGQGLWWL QCTVTGKDAH TGSTPMNMRV NAGLGMARMT EAAHQIAMAH QPHAVGAVGH CDVFPNSRNV IPGKVVFTVD FRSPDLEKLT SMRTQYEAKA KEIAAELGLG LEIEPVGHFD PVTFDESCVS AVRGAAERLG YSHMDIVSGA GHDACWINDV APTAMIMCPC VDGLSHNEAE EISKDWAAAG TDVMLHAVLE TAEIVA
|
| |