Gene TM1040_3479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3479 
Symbol 
ID4075113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp503802 
End bp505187 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content63% 
IMG OID638004988 
Productguanine deaminase 
Protein accessionYP_611713 
Protein GI99078455 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGTTGCT CCCAAAAGGG GCGCGACGCG GGCCTGCGCT GGGCGCTGGG GCTCTCTCCT 
AGTTTACGAA AGGCGAACCT GATGTCCGCT GATGCCAAGA TCCTGCGTGG CCGCACGCTG
ACTTTTCATG CCGAGCCCAA CGGGCCAGAT GATACCGCGG CCTATACCTG TCTCGAGGAT
GGCGCGCTGC TGATCCGCGA TGGTCGCATT GCCGCTCATG GGGCCTATGC CGAGGTTCTG
CGCGCCGCGC CCGAGGCCGA GGTGGTCGAT CACCGCCCAC ATCTGCTGAT GCCGGGCTTT
ATCGATCTGC ATCTGCACTT TCCGCAGGTA CAGGTGGTGG CCTCTTGGGG CGAGCAGCTG
CTCGATTGGC TCAACACGTA TACCTTCCCC GCCGAGGTGC AGTTTGCCGA CAAGACCCAT
GCGGACCGGA TGGCGCGTGC GTTCTTTGAT CTGGTTCTGA GCCACGGCAC CACCACCGCG
GTGGCCTTCT GTTCGGTGCA TCCCGCCTCG GCCGAGGCCT ATTTTGCCGA GGCAGCGCGC
CGCAACATGC GGATGATTGG CGGCAAGGTG ATGATGGATC GCAACGCCCC CGACGGGCTG
CGCGATACGG CCCGACAGGG CTATGACGAA ACCAAGGCTC TGATCGAGCG CTGGCACGGC
AAGGGGCGGG CCTCCTACGC GATCTCGCCG CGCTTTGCGA TCACCTCGAC CCCGGACCAG
CTGGAGATGG CAGGTGCCCT GGTGGCAGAG CACCCGGATG CCTATGTGCA GACACATCTC
TCGGAAAATC GAGACGAGAT CGATTTTACA CTGAGCCTGT ATCCGGATGC GCCAGACTAC
CTTGGCATCT ACGAGCGCTA CGGGCTGGTG CACGACAAGA CACTCCTGGG CCATGCAATC
CATCTGGAAC CGCGCGAGAT TGATCTGCTG GCGGAGGTGG GTGGCAAGCC GGTGTTCTGC
CCCACGTCCA ACCTGTTTCT CGGCAGCGGA CTCTTTGATG ACGGGGGACT GCGGGCCAAG
GGCATCCAGA ACGGGATCGC CACCGATATC GGGGCGGGCA CCAGCTATTC GATGCTGCAG
ACCCTCAATG AAGGCTACAA GATCCTTCAA CTGCAAAACC AGAAGCTTCA CCCACTGAAC
GCGTTCCACT GGATCACCCG TGGCAATGCC GAAGTCCTGG GGCAGCTGCA GGAGATCGGC
ACCTTGGATG TTGGATCCGA GGCCGACATC GTGGTGCTCG ACGTGGCTGC TACCCCGGCG
ATGGCGTTGC GGGCCGAGGC TGCCACGTCC CTGTCGGAGG AGTTGTTCAT CCTTCAGATG
CTCGGGGACG ACCGCGCCGT GGTGGAAACA TATGTGGCTG GCGAGGCGAT GAAGCCGCAA
AGCTGA
 
Protein sequence
MRCSQKGRDA GLRWALGLSP SLRKANLMSA DAKILRGRTL TFHAEPNGPD DTAAYTCLED 
GALLIRDGRI AAHGAYAEVL RAAPEAEVVD HRPHLLMPGF IDLHLHFPQV QVVASWGEQL
LDWLNTYTFP AEVQFADKTH ADRMARAFFD LVLSHGTTTA VAFCSVHPAS AEAYFAEAAR
RNMRMIGGKV MMDRNAPDGL RDTARQGYDE TKALIERWHG KGRASYAISP RFAITSTPDQ
LEMAGALVAE HPDAYVQTHL SENRDEIDFT LSLYPDAPDY LGIYERYGLV HDKTLLGHAI
HLEPREIDLL AEVGGKPVFC PTSNLFLGSG LFDDGGLRAK GIQNGIATDI GAGTSYSMLQ
TLNEGYKILQ LQNQKLHPLN AFHWITRGNA EVLGQLQEIG TLDVGSEADI VVLDVAATPA
MALRAEAATS LSEELFILQM LGDDRAVVET YVAGEAMKPQ S