Gene Sde_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSde_2149 
Symbol 
ID3967533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSaccharophagus degradans 2-40 
KingdomBacteria 
Replicon accessionNC_007912 
Strand
Start bp2742551 
End bp2743891 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content48% 
IMG OID637921239 
ProductN-ethylammeline chlorohydrolase 
Protein accessionYP_527621 
Protein GI90021794 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.450112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA AAACACCTGC CGACCTTATT GTAAGCGCCC GCTGGATATT GCCAGTTCGC 
CCAACAGGCC GCCTATATGA GCACTGCGCA TTAGTAATAC GAGATGGCAA CATTATTGAA
ATTGTGCCAA CTAGCGGTAT AGACAGCCAA TTTGACTACC AAGAGCACAT CGATTTGCCC
AATCAGCTAC TTATGCCAGG GCTTATTAAT ATGCACGGGC ATGCAGCCAT GAGCTTGTTT
AGGGGTTTGG CAGACGACTT ACCGCTTATG GAGTGGCTGC AAGATCACAT TTGGCCTGCT
GAAGGTGAAT GGGTAGACGA ACAATTTGTG CTCGACGGTA CTCAGCTAGC CATGGCCGAA
ATGCTATTAA GTGGCACCAC TTGCTTCTCA GATATGTACT TTTACCCCGA AGCTGCTGCC
GGCGCAGCCT TTGAAGCGGG TATGCGCGCC CAAATTAACT TTCCTATACT CGACTTTCCC
ACCCAATGGG GAAGCGGCCC CGAAGATTAC ATTCACAAAG GCCTAAAACT GCACGATAAC
TATCGCTCTG TAGATCTTAT CAACATTGGC TTTGGCCCAC ATGCGCCTTA CACAGTGTCT
GATGAGCCTT TAAAGCGAAT AGCGGTTTTA GCAGAAGAGC TGCAAGCCCC TATTCAAATA
CATATGCACG AAACCGCGCA AGAAGTAAGC GACTCGATAG CTAATTTTGG GGTGCGACCA
TTGCAACGCA TTGCAGACCT AGGCCTACTC GGCCCCGCTA CCCAGCTAGT GCATATGACG
CAAATAGATG AACAAGACAT AGCACTACTT ACCACCTATT CTGCCCATGT AGTGCACTGC
CCTGAATCTA ACTTGAAATT GGCCAGCGGT TTTTGCCCTG TACATACACT GCAAGAGCAC
TGCATTAACA CGTGCTTGGG CACAGACGGC GCTGCTTCGA ACAACGACCT AAGCTTGTTT
GACGAAATGC ACACAGCAAG CTTGCTGGGC AAAGGCGTTG CACAACGCGC CGATGCACTT
AAAAGCGATA CTGCAATAGA AATGGCCACC ATTAATGCCG CCACAGCAAT GGGGCTAGAC
AATATTGTTG GCAGCTTAGA AAAAGGTAAG CGTGCAGATT TTATTGCAAT TGACTTTAGT
AATTTGCAGC AAGCCCCCAT CTATAATTTA AAGAGCCATC TCGTTAATAC CCACGTTAGC
CACCTAGTTA CCCATGTATG GGTAGATGGC AAATGCTTAG TTGCAGAACG CGAATTACAA
ACCTTGGACA CCAAAGACAT TTACAGCAAA GCCTGCGCTT GGCAAGTTAA AATTCAAGCG
CAGCGCGCTT CCGGCAAATA G
 
Protein sequence
MTNKTPADLI VSARWILPVR PTGRLYEHCA LVIRDGNIIE IVPTSGIDSQ FDYQEHIDLP 
NQLLMPGLIN MHGHAAMSLF RGLADDLPLM EWLQDHIWPA EGEWVDEQFV LDGTQLAMAE
MLLSGTTCFS DMYFYPEAAA GAAFEAGMRA QINFPILDFP TQWGSGPEDY IHKGLKLHDN
YRSVDLINIG FGPHAPYTVS DEPLKRIAVL AEELQAPIQI HMHETAQEVS DSIANFGVRP
LQRIADLGLL GPATQLVHMT QIDEQDIALL TTYSAHVVHC PESNLKLASG FCPVHTLQEH
CINTCLGTDG AASNNDLSLF DEMHTASLLG KGVAQRADAL KSDTAIEMAT INAATAMGLD
NIVGSLEKGK RADFIAIDFS NLQQAPIYNL KSHLVNTHVS HLVTHVWVDG KCLVAERELQ
TLDTKDIYSK ACAWQVKIQA QRASGK