Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1906 |
Symbol | |
ID | 4077403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 2009233 |
End bp | 2010573 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638007222 |
Product | hydroxydechloroatrazine ethylaminohydrolase |
Protein accession | YP_613901 |
Protein GI | 99081747 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGACT TTCTTTTGAA AAATGCCCAA ACCGTCTTGA CCATGGACGA CGACCGACGC GTGCTTCACG CGGTCGACAT CCGGGTACGC GCGGGGGTGA TTGCCGAGAT CGGCCCCACG CTCGGGGGCG CTGAAACCAA GGTGGATGTG AGCGGTGCGG TGGTGACACC GGGGCTCGTG AATACGCATC ACCATCTCTA TCAGAACCTC ACGCGGGCGG TGCCGGGGGG GCAGGATGCG CTACTCTTTG GTTGGCTGCA GACGCTCTAT CCGATCTGGG CGCGCATGGG GCCGGAGCAT CTTGAGGTCT CCACCCAGCT TGGTCTGGCG GAGCTTGCGC TTTCGGGGTG TAGCCTTACA TCGGATCATC TCTATCTCTT TCCCAATGGG GGCCGGCTCG AGGATACCAT TCACGCGGCG GCGGAGGTGG GCTTGCGGTT TCATCCCACC CGCGGCGCCA TGAGCATCGG CGAAAGCGAT GGCGGTTTGC CGCCCGACAG TCTGGTGGAG CGCGAGGCGG ACATTCTCGC GGACATGATC CGGTTGGTTG ACGCCTATCA CGACCCGAGC GATGGGGCGA TGTGCCGGGT CGGACTTGCG CCCTGTTCGC CCTTTTCGGT GAGCCGGGAA CTGATGCGCG ACACGGCGCT TCTGGCACGG GACAAGGGGG TGATGCTGCA TACGCATCTG GCCGAGAATG ACGAGGACAT CGCCTATAGC GAGGCCCAGT TTGGCTGTCG CCCCGGACAA TATGCGGAGG ATCTCGGCTG GACCGGCGAT GACGTCTGGC ACGCGCATTG CGTGAAGCTG GACGTGGAAG AGATTGACCT CTTTGCCAAA ACCCGTACCG GCGTGGCGCA TTGTCCCTGT TCCAACTGTC GCCTCGGTAG CGGCATCGCA CCCGTGCGCC AGATGCGCGA TGCGGGCGTC AAGGTGGGGC TCGGCGTCGA TGGCTCGGCC AGCAATGACA TGGCCAGCCT CTGGGATGAA GCCCGTCAGG CACTGCTGCT CCAGCGGGTT GCCAATGGCG CCGACGCCAT GTCCGCCTAT GAGGCGCTGG AGATCGCGAC ACGCGGCGGG GCCGACGTAC TGGGGCGGCC GGACTGCGGC CGGATTGCGG TCGGAAAACG CGCCGATATC GCGGTCTGGG ATGTCTCCGG GCTGGCGTCC AGCGGCAGCT GGGATCCAGC GGCGCTGGTT CTGGCCGGTC CGCGCCAGGT GCGGGATCTC TTTGTCGAGG GGCGCCAGGT GGTGGCCTCT GGTCGGTTGA CCACGGTTGA TACGGCGGCG GTGATCCGCC GTCACGGTGC CTTGGCGCAG GCCTTGGCGA ACGGAGACTA A
|
Protein sequence | MTDFLLKNAQ TVLTMDDDRR VLHAVDIRVR AGVIAEIGPT LGGAETKVDV SGAVVTPGLV NTHHHLYQNL TRAVPGGQDA LLFGWLQTLY PIWARMGPEH LEVSTQLGLA ELALSGCSLT SDHLYLFPNG GRLEDTIHAA AEVGLRFHPT RGAMSIGESD GGLPPDSLVE READILADMI RLVDAYHDPS DGAMCRVGLA PCSPFSVSRE LMRDTALLAR DKGVMLHTHL AENDEDIAYS EAQFGCRPGQ YAEDLGWTGD DVWHAHCVKL DVEEIDLFAK TRTGVAHCPC SNCRLGSGIA PVRQMRDAGV KVGLGVDGSA SNDMASLWDE ARQALLLQRV ANGADAMSAY EALEIATRGG ADVLGRPDCG RIAVGKRADI AVWDVSGLAS SGSWDPAALV LAGPRQVRDL FVEGRQVVAS GRLTTVDTAA VIRRHGALAQ ALANGD
|
| |