Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_2149 |
Symbol | |
ID | 3967533 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 2742551 |
End bp | 2743891 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637921239 |
Product | N-ethylammeline chlorohydrolase |
Protein accession | YP_527621 |
Protein GI | 90021794 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.450112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAACA AAACACCTGC CGACCTTATT GTAAGCGCCC GCTGGATATT GCCAGTTCGC CCAACAGGCC GCCTATATGA GCACTGCGCA TTAGTAATAC GAGATGGCAA CATTATTGAA ATTGTGCCAA CTAGCGGTAT AGACAGCCAA TTTGACTACC AAGAGCACAT CGATTTGCCC AATCAGCTAC TTATGCCAGG GCTTATTAAT ATGCACGGGC ATGCAGCCAT GAGCTTGTTT AGGGGTTTGG CAGACGACTT ACCGCTTATG GAGTGGCTGC AAGATCACAT TTGGCCTGCT GAAGGTGAAT GGGTAGACGA ACAATTTGTG CTCGACGGTA CTCAGCTAGC CATGGCCGAA ATGCTATTAA GTGGCACCAC TTGCTTCTCA GATATGTACT TTTACCCCGA AGCTGCTGCC GGCGCAGCCT TTGAAGCGGG TATGCGCGCC CAAATTAACT TTCCTATACT CGACTTTCCC ACCCAATGGG GAAGCGGCCC CGAAGATTAC ATTCACAAAG GCCTAAAACT GCACGATAAC TATCGCTCTG TAGATCTTAT CAACATTGGC TTTGGCCCAC ATGCGCCTTA CACAGTGTCT GATGAGCCTT TAAAGCGAAT AGCGGTTTTA GCAGAAGAGC TGCAAGCCCC TATTCAAATA CATATGCACG AAACCGCGCA AGAAGTAAGC GACTCGATAG CTAATTTTGG GGTGCGACCA TTGCAACGCA TTGCAGACCT AGGCCTACTC GGCCCCGCTA CCCAGCTAGT GCATATGACG CAAATAGATG AACAAGACAT AGCACTACTT ACCACCTATT CTGCCCATGT AGTGCACTGC CCTGAATCTA ACTTGAAATT GGCCAGCGGT TTTTGCCCTG TACATACACT GCAAGAGCAC TGCATTAACA CGTGCTTGGG CACAGACGGC GCTGCTTCGA ACAACGACCT AAGCTTGTTT GACGAAATGC ACACAGCAAG CTTGCTGGGC AAAGGCGTTG CACAACGCGC CGATGCACTT AAAAGCGATA CTGCAATAGA AATGGCCACC ATTAATGCCG CCACAGCAAT GGGGCTAGAC AATATTGTTG GCAGCTTAGA AAAAGGTAAG CGTGCAGATT TTATTGCAAT TGACTTTAGT AATTTGCAGC AAGCCCCCAT CTATAATTTA AAGAGCCATC TCGTTAATAC CCACGTTAGC CACCTAGTTA CCCATGTATG GGTAGATGGC AAATGCTTAG TTGCAGAACG CGAATTACAA ACCTTGGACA CCAAAGACAT TTACAGCAAA GCCTGCGCTT GGCAAGTTAA AATTCAAGCG CAGCGCGCTT CCGGCAAATA G
|
Protein sequence | MTNKTPADLI VSARWILPVR PTGRLYEHCA LVIRDGNIIE IVPTSGIDSQ FDYQEHIDLP NQLLMPGLIN MHGHAAMSLF RGLADDLPLM EWLQDHIWPA EGEWVDEQFV LDGTQLAMAE MLLSGTTCFS DMYFYPEAAA GAAFEAGMRA QINFPILDFP TQWGSGPEDY IHKGLKLHDN YRSVDLINIG FGPHAPYTVS DEPLKRIAVL AEELQAPIQI HMHETAQEVS DSIANFGVRP LQRIADLGLL GPATQLVHMT QIDEQDIALL TTYSAHVVHC PESNLKLASG FCPVHTLQEH CINTCLGTDG AASNNDLSLF DEMHTASLLG KGVAQRADAL KSDTAIEMAT INAATAMGLD NIVGSLEKGK RADFIAIDFS NLQQAPIYNL KSHLVNTHVS HLVTHVWVDG KCLVAERELQ TLDTKDIYSK ACAWQVKIQA QRASGK
|
| |