Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3052 |
Symbol | |
ID | 4075146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | - |
Start bp | 21606 |
End bp | 22877 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 638004553 |
Product | amidohydrolase |
Protein accession | YP_611288 |
Protein GI | 99078030 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCATG CTCCTGGGGC GCAATCCCGC CTGTCTGCCT CCCGTTTGAC CGAGGCCCGT CTGCCCGGCG TCGCCGTTCC CGCCGCTCTC GTTGCCTCTG CCGACCGCTT TGGGGGCCAG CCGCAGGGCG AGCACCTGGT GGGCGACCTT GTGCTGCGAA ACGGGCGTGC TGAGCGGCTC GAGGCCGCGA CGGTGCCGCC CCGGAGATTG GTGCTGCCAA AACTCACAGA GCCGCATGTG CATCTCGACA AGTGCCACAC GATCTACCGA ATGGACGGGG TCGGCGGCGG TCTGGAAGAT GCGATTTCGG CGCAGGCTCT GGACCGGGAA ACCTGGACTG CTGATGATAT TCGCGCGCGG GCGGGGCGGG GACTGGGAGA GCTCCTGGCC GCGGGCTGTT CTGCTGTGCG TTCCCATGTG GATTGGGGCA GCGGGCGCGA CCCGGCGCAG GCCCCACTGG CCTGGGATAT CCTGAGGGAG CTGGCCCAAG ACGCCTCGGA TGCGGTGATC GTGCAGCGTG CGGCGCTGAC AGGAGCCGAC AGGATGGCCG ACATCGGCTA TGCGCGGGCT TGCGCTGCGC GCGTGGCCCA GAGCGGCGGC GTGCTTGGAT CCTTTGTGCT GAACCAGCCG GGTCGCAAAG AGGGCATCGC CAATATCTTT CGCGTTGCAG AGGATATGGG GCTGGCTCTT GATTTTCACG TCGACGAAGG CCTCGCGCGG GGGCTCGACG GCTTGGAGAT GATCGCCGAC GCCGCTCTCG CCACCCGGTT CGGCGGACCG GTTCTCTGCG GCCATGCCTG CAGCCTGATG AACCGCTCCG ATGAGGATGT GCGGCGGATT GCCGAAAAGC TCGCCCGCGC TGAAATCTCC GTGGTCGCGC TTCCGACCAC CAATCTGTAC TTGCAGGGGC GCAACAACGG CACGCCGGAC CGCCGGGGGC TGACGCGGAT TCACGAGCTT GCTGCTGCAG GCGTAAACGT GGTGCTCGGC GCGGACAATG TGCGCGATGC CTTCTGCCCG CTCGGCAGTC ACGACCCGCT GGCGACGCTT TCGCTGGCGG TGCTTGCCGG GCATCTCGAT CCGCCTTTTG GCGACCATCT ACCCATGATC ACCACCGGCG CACGCCGCGC GCTTGGCCTT GCCCCCGTGA CCGTCGACGG GGCTGCAATC GGGGATCTGC AGCTGTTCGA CGCGCTTTTG GTCACGGACA TTCTGGGCAG CCGATCTGCG CCGCGTCCCC TGACCGACGA TTTGCCAGGA GCCTCCCTAT GA
|
Protein sequence | MSHAPGAQSR LSASRLTEAR LPGVAVPAAL VASADRFGGQ PQGEHLVGDL VLRNGRAERL EAATVPPRRL VLPKLTEPHV HLDKCHTIYR MDGVGGGLED AISAQALDRE TWTADDIRAR AGRGLGELLA AGCSAVRSHV DWGSGRDPAQ APLAWDILRE LAQDASDAVI VQRAALTGAD RMADIGYARA CAARVAQSGG VLGSFVLNQP GRKEGIANIF RVAEDMGLAL DFHVDEGLAR GLDGLEMIAD AALATRFGGP VLCGHACSLM NRSDEDVRRI AEKLARAEIS VVALPTTNLY LQGRNNGTPD RRGLTRIHEL AAAGVNVVLG ADNVRDAFCP LGSHDPLATL SLAVLAGHLD PPFGDHLPMI TTGARRALGL APVTVDGAAI GDLQLFDALL VTDILGSRSA PRPLTDDLPG ASL
|
| |