Gene TM1040_1906 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1906 
Symbol 
ID4077403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2009233 
End bp2010573 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content65% 
IMG OID638007222 
Producthydroxydechloroatrazine ethylaminohydrolase 
Protein accessionYP_613901 
Protein GI99081747 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGACT TTCTTTTGAA AAATGCCCAA ACCGTCTTGA CCATGGACGA CGACCGACGC 
GTGCTTCACG CGGTCGACAT CCGGGTACGC GCGGGGGTGA TTGCCGAGAT CGGCCCCACG
CTCGGGGGCG CTGAAACCAA GGTGGATGTG AGCGGTGCGG TGGTGACACC GGGGCTCGTG
AATACGCATC ACCATCTCTA TCAGAACCTC ACGCGGGCGG TGCCGGGGGG GCAGGATGCG
CTACTCTTTG GTTGGCTGCA GACGCTCTAT CCGATCTGGG CGCGCATGGG GCCGGAGCAT
CTTGAGGTCT CCACCCAGCT TGGTCTGGCG GAGCTTGCGC TTTCGGGGTG TAGCCTTACA
TCGGATCATC TCTATCTCTT TCCCAATGGG GGCCGGCTCG AGGATACCAT TCACGCGGCG
GCGGAGGTGG GCTTGCGGTT TCATCCCACC CGCGGCGCCA TGAGCATCGG CGAAAGCGAT
GGCGGTTTGC CGCCCGACAG TCTGGTGGAG CGCGAGGCGG ACATTCTCGC GGACATGATC
CGGTTGGTTG ACGCCTATCA CGACCCGAGC GATGGGGCGA TGTGCCGGGT CGGACTTGCG
CCCTGTTCGC CCTTTTCGGT GAGCCGGGAA CTGATGCGCG ACACGGCGCT TCTGGCACGG
GACAAGGGGG TGATGCTGCA TACGCATCTG GCCGAGAATG ACGAGGACAT CGCCTATAGC
GAGGCCCAGT TTGGCTGTCG CCCCGGACAA TATGCGGAGG ATCTCGGCTG GACCGGCGAT
GACGTCTGGC ACGCGCATTG CGTGAAGCTG GACGTGGAAG AGATTGACCT CTTTGCCAAA
ACCCGTACCG GCGTGGCGCA TTGTCCCTGT TCCAACTGTC GCCTCGGTAG CGGCATCGCA
CCCGTGCGCC AGATGCGCGA TGCGGGCGTC AAGGTGGGGC TCGGCGTCGA TGGCTCGGCC
AGCAATGACA TGGCCAGCCT CTGGGATGAA GCCCGTCAGG CACTGCTGCT CCAGCGGGTT
GCCAATGGCG CCGACGCCAT GTCCGCCTAT GAGGCGCTGG AGATCGCGAC ACGCGGCGGG
GCCGACGTAC TGGGGCGGCC GGACTGCGGC CGGATTGCGG TCGGAAAACG CGCCGATATC
GCGGTCTGGG ATGTCTCCGG GCTGGCGTCC AGCGGCAGCT GGGATCCAGC GGCGCTGGTT
CTGGCCGGTC CGCGCCAGGT GCGGGATCTC TTTGTCGAGG GGCGCCAGGT GGTGGCCTCT
GGTCGGTTGA CCACGGTTGA TACGGCGGCG GTGATCCGCC GTCACGGTGC CTTGGCGCAG
GCCTTGGCGA ACGGAGACTA A
 
Protein sequence
MTDFLLKNAQ TVLTMDDDRR VLHAVDIRVR AGVIAEIGPT LGGAETKVDV SGAVVTPGLV 
NTHHHLYQNL TRAVPGGQDA LLFGWLQTLY PIWARMGPEH LEVSTQLGLA ELALSGCSLT
SDHLYLFPNG GRLEDTIHAA AEVGLRFHPT RGAMSIGESD GGLPPDSLVE READILADMI
RLVDAYHDPS DGAMCRVGLA PCSPFSVSRE LMRDTALLAR DKGVMLHTHL AENDEDIAYS
EAQFGCRPGQ YAEDLGWTGD DVWHAHCVKL DVEEIDLFAK TRTGVAHCPC SNCRLGSGIA
PVRQMRDAGV KVGLGVDGSA SNDMASLWDE ARQALLLQRV ANGADAMSAY EALEIATRGG
ADVLGRPDCG RIAVGKRADI AVWDVSGLAS SGSWDPAALV LAGPRQVRDL FVEGRQVVAS
GRLTTVDTAA VIRRHGALAQ ALANGD