Gene TM1040_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1905 
Symbol 
ID4077402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2007845 
End bp2009134 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content64% 
IMG OID638007221 
Productguanine deaminase 
Protein accessionYP_613900 
Protein GI99081746 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACAAG ATCTCATTCT TCTGGGCCGA GTGCTGTCTT TTTGCGCCTC CCCTTTCGCG 
CCCGATGGGC CCGAGGCGGC CGTCGAGGAA CACGAAGCCA TTGCCCTGCG CGCGGGCAAG
ATCGTGGCGC TTGGCAGCCG GGCAGCGCTC AGCAAGGTGC TGCCAGAGGC AGAGATCGTC
GATCACGGGC AGAAGCTGAT CGTGCCGGGC TTTGTCGATG CCCATGTGCA TTATCCGCAG
ACCGCGATCA TCGCCAGCTG GGGCAAGCGG CTGATCGACT GGCTCAACAC CTATACCTTT
CCCGAGGAGA TGCGGTTTGG CAACCCGGCC TATGCCGCAG AGATCGCAGC GCGCTACCTG
GATCTGACCA CCGCCTGCGG CACCACGACG GTTGCGAGCT ACTGCACCAT CCATCCCGAG
AGCGTTGATG CGTTGTTCGA GGCGGCGCAG GCCCGCGGGC AGCGCGTGGT GGCCGGCAAG
ACCTGCATGG ACCGCAACGC GCCCGAAGGC CTGCGCGATA CGCCGCAATC CGCCTATGAC
GACAGCGCCG CGCTGATCCA GCGCTGGCAT GGGAGGGAGC GGCTCATCTA TGCGATTACG
CCACGGTTTT CACCAACCTC CACCCCAGAA CAACTCGAAG CGCTCGGCGC ACTCTGGACG
GCGCATCCTA CCTGCCTGAT GCAGACCCAT CTGAGCGAGC AACTGGACGA GATCGAATGG
GTCCGAACCC TCTTTCCCGA GGCGCGCGAC TACCTTGACA CCTACGAGCG CTACGGTCTT
CTGCGCGAAG GCGCGCTCTT TGGCCATGCC ATCCATCTTG AACCCCGCGA ACGCGCCCGC
CTCCTGGAGG CCCGCGCCAG CCTGATCCAC TGCCCCACCT CCAATACCTT CATCGGTTCA
GGGCTTTTTG ACATGAACGG CCTCATGCGC GAGGGCCACC GGATCGGCCT TGCCACCGAC
ACCGGCGGCG GGTCGAGTTT TTCGATGCTG CGTACCATGG CCGCCGCCTA TGAGGTCGCC
CAGCTGCGGG GCACGCCTTT GCACGCGGCA CAGCTCCTGT GGCTTGCAAC CCTCGGGTCG
GCCACGGCAC TTGGGCTTCA GGACAAGGTC GGTAATCTCG CCGTGGGACG GGAGGCCGAC
CTTGTGGTGC TTGATCTTGC CTCGACCCCG GCCATTGCCC AGCGTGCGAC ACGCGCAGAG
ACCCTCTGGG AGGCCCTCTT TCCCACGCTG ATGATGGGCG ATGACCGCGC CATTGCCGAG
GTCCGCATCA ACGGGCAGAA AGCGGTCTAG
 
Protein sequence
MKQDLILLGR VLSFCASPFA PDGPEAAVEE HEAIALRAGK IVALGSRAAL SKVLPEAEIV 
DHGQKLIVPG FVDAHVHYPQ TAIIASWGKR LIDWLNTYTF PEEMRFGNPA YAAEIAARYL
DLTTACGTTT VASYCTIHPE SVDALFEAAQ ARGQRVVAGK TCMDRNAPEG LRDTPQSAYD
DSAALIQRWH GRERLIYAIT PRFSPTSTPE QLEALGALWT AHPTCLMQTH LSEQLDEIEW
VRTLFPEARD YLDTYERYGL LREGALFGHA IHLEPRERAR LLEARASLIH CPTSNTFIGS
GLFDMNGLMR EGHRIGLATD TGGGSSFSML RTMAAAYEVA QLRGTPLHAA QLLWLATLGS
ATALGLQDKV GNLAVGREAD LVVLDLASTP AIAQRATRAE TLWEALFPTL MMGDDRAIAE
VRINGQKAV