Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_1229 |
Symbol | |
ID | 3569413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | - |
Start bp | 1333443 |
End bp | 1334777 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637679696 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_284455 |
Protein GI | 71906868 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 3.42003e-16 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.25554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAA CACCTGAAGT TATCGACCTT CTGATCGAAG CACGCTGGAT TGCTGCGGTC GACCCCGACG TGGTACACAA AAACCACGCC GTCGCCGTCA ACCAAGGTCG CATTCTGGCC ATCCTGCCCG CCGGCGAAGC CCGTGCCCTG TATGCGCCGA AGAAGACCAC GGTACTTCAG GACCACATCC TGATTCCCGG CCTGATCAAC CTGCACACCC ACGCCGCGAT GAGCCTGATG CGTGGCCTGG CCGACGACCT GCCATTGATG GAGTGGCTAC AGAAACATAT CTGGCCGACA GAAGCCGCCC ATCTCTCGTC GCAGTTCGTC TACGACGGCA CCCGCCTCGC CTGCGCTGAA ATGCTCAAAG GCGGGATTAC CTGCTTCAAC GACATGTACT TTTTCCCGGA AGCTGCTGCG ACGGCCGCAT CCGAATTCGG CATGCGTGCA ATGCTCGGCA TCACCACGCT GGAGTTCCCA ACCCCCTATG CCAGTGATGC CACCGATTAC ATCAACAAGG GACTGGCCGT CCGCGAAGCG TGGCACAACA ATCCACTGAT TGACTTCTGC CTGGCACCCC ACGCCCCTTA CACCGTGTCT GACAGCACTT TTGAACGGAT TCTGACGCTC TCCGAGCAAC TGAATCTGCC GGTGCACTGC CACATTCACG AAACACAGCA GGAAATTGAC GAAAATCTTA AGCAGCACAA ACTGCGACCG CTGGCCCGCC TGCACAAACT CGGACTACTC GGCCCCAATT TCATCGGGGT CCACGCCGTC CACCTGAACG ACGACGACCT GCAACTACTC GCCGATACCG GCTGCAATAT CGCCCACTGC CCCACCTCTA ACCTCAAACT GGCCAGCGGC TTTGCTCCAG TAGCGAAAAT GAGACAATTT AGCATCAATG TCGGGCTAGG TACCGACGGC GCGGCAAGCA ACAACCGCCT CGACCTGTTT GGCGAAATGC GCCTGGCCTC CCTATTAGCC AAAGGACTAA CCGGTGATGC CAGCGCCCTG CCAGCCAGAG AAATCCTGCG CATGGCCACA CTCTATGCCG CCCAAGCACT GGGACTCGGC AATGAAGTCG GCTCGATCAC CCCGGGAAAA TCAGCAGACC TTTGTGCCGT CAGCCTGGCG GCATTGGAAA CACGCCCCTG TTTCGACCCA GTGTCACACC TGATCAATGT CTCAGGCCGA GAATCAGTTA CTCACGTCTG GGTCGCCGGA AAGTGTTGCG TAGACGACAA ATCTTTACTT CAGCACGACC AAAATGATTT GGAATCTGCG ATAGCACTAT GGCAGAATAG TTTGGAATTC CGCCAGCGGA CCTGA
|
Protein sequence | MTTTPEVIDL LIEARWIAAV DPDVVHKNHA VAVNQGRILA ILPAGEARAL YAPKKTTVLQ DHILIPGLIN LHTHAAMSLM RGLADDLPLM EWLQKHIWPT EAAHLSSQFV YDGTRLACAE MLKGGITCFN DMYFFPEAAA TAASEFGMRA MLGITTLEFP TPYASDATDY INKGLAVREA WHNNPLIDFC LAPHAPYTVS DSTFERILTL SEQLNLPVHC HIHETQQEID ENLKQHKLRP LARLHKLGLL GPNFIGVHAV HLNDDDLQLL ADTGCNIAHC PTSNLKLASG FAPVAKMRQF SINVGLGTDG AASNNRLDLF GEMRLASLLA KGLTGDASAL PAREILRMAT LYAAQALGLG NEVGSITPGK SADLCAVSLA ALETRPCFDP VSHLINVSGR ESVTHVWVAG KCCVDDKSLL QHDQNDLESA IALWQNSLEF RQRT
|
| |