Gene TM1040_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3052 
Symbol 
ID4075146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp21606 
End bp22877 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content67% 
IMG OID638004553 
Productamidohydrolase 
Protein accessionYP_611288 
Protein GI99078030 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATG CTCCTGGGGC GCAATCCCGC CTGTCTGCCT CCCGTTTGAC CGAGGCCCGT 
CTGCCCGGCG TCGCCGTTCC CGCCGCTCTC GTTGCCTCTG CCGACCGCTT TGGGGGCCAG
CCGCAGGGCG AGCACCTGGT GGGCGACCTT GTGCTGCGAA ACGGGCGTGC TGAGCGGCTC
GAGGCCGCGA CGGTGCCGCC CCGGAGATTG GTGCTGCCAA AACTCACAGA GCCGCATGTG
CATCTCGACA AGTGCCACAC GATCTACCGA ATGGACGGGG TCGGCGGCGG TCTGGAAGAT
GCGATTTCGG CGCAGGCTCT GGACCGGGAA ACCTGGACTG CTGATGATAT TCGCGCGCGG
GCGGGGCGGG GACTGGGAGA GCTCCTGGCC GCGGGCTGTT CTGCTGTGCG TTCCCATGTG
GATTGGGGCA GCGGGCGCGA CCCGGCGCAG GCCCCACTGG CCTGGGATAT CCTGAGGGAG
CTGGCCCAAG ACGCCTCGGA TGCGGTGATC GTGCAGCGTG CGGCGCTGAC AGGAGCCGAC
AGGATGGCCG ACATCGGCTA TGCGCGGGCT TGCGCTGCGC GCGTGGCCCA GAGCGGCGGC
GTGCTTGGAT CCTTTGTGCT GAACCAGCCG GGTCGCAAAG AGGGCATCGC CAATATCTTT
CGCGTTGCAG AGGATATGGG GCTGGCTCTT GATTTTCACG TCGACGAAGG CCTCGCGCGG
GGGCTCGACG GCTTGGAGAT GATCGCCGAC GCCGCTCTCG CCACCCGGTT CGGCGGACCG
GTTCTCTGCG GCCATGCCTG CAGCCTGATG AACCGCTCCG ATGAGGATGT GCGGCGGATT
GCCGAAAAGC TCGCCCGCGC TGAAATCTCC GTGGTCGCGC TTCCGACCAC CAATCTGTAC
TTGCAGGGGC GCAACAACGG CACGCCGGAC CGCCGGGGGC TGACGCGGAT TCACGAGCTT
GCTGCTGCAG GCGTAAACGT GGTGCTCGGC GCGGACAATG TGCGCGATGC CTTCTGCCCG
CTCGGCAGTC ACGACCCGCT GGCGACGCTT TCGCTGGCGG TGCTTGCCGG GCATCTCGAT
CCGCCTTTTG GCGACCATCT ACCCATGATC ACCACCGGCG CACGCCGCGC GCTTGGCCTT
GCCCCCGTGA CCGTCGACGG GGCTGCAATC GGGGATCTGC AGCTGTTCGA CGCGCTTTTG
GTCACGGACA TTCTGGGCAG CCGATCTGCG CCGCGTCCCC TGACCGACGA TTTGCCAGGA
GCCTCCCTAT GA
 
Protein sequence
MSHAPGAQSR LSASRLTEAR LPGVAVPAAL VASADRFGGQ PQGEHLVGDL VLRNGRAERL 
EAATVPPRRL VLPKLTEPHV HLDKCHTIYR MDGVGGGLED AISAQALDRE TWTADDIRAR
AGRGLGELLA AGCSAVRSHV DWGSGRDPAQ APLAWDILRE LAQDASDAVI VQRAALTGAD
RMADIGYARA CAARVAQSGG VLGSFVLNQP GRKEGIANIF RVAEDMGLAL DFHVDEGLAR
GLDGLEMIAD AALATRFGGP VLCGHACSLM NRSDEDVRRI AEKLARAEIS VVALPTTNLY
LQGRNNGTPD RRGLTRIHEL AAAGVNVVLG ADNVRDAFCP LGSHDPLATL SLAVLAGHLD
PPFGDHLPMI TTGARRALGL APVTVDGAAI GDLQLFDALL VTDILGSRSA PRPLTDDLPG
ASL