Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1905 |
Symbol | |
ID | 4077402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2007845 |
End bp | 2009134 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 638007221 |
Product | guanine deaminase |
Protein accession | YP_613900 |
Protein GI | 99081746 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | [TIGR02967] guanine deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAG ATCTCATTCT TCTGGGCCGA GTGCTGTCTT TTTGCGCCTC CCCTTTCGCG CCCGATGGGC CCGAGGCGGC CGTCGAGGAA CACGAAGCCA TTGCCCTGCG CGCGGGCAAG ATCGTGGCGC TTGGCAGCCG GGCAGCGCTC AGCAAGGTGC TGCCAGAGGC AGAGATCGTC GATCACGGGC AGAAGCTGAT CGTGCCGGGC TTTGTCGATG CCCATGTGCA TTATCCGCAG ACCGCGATCA TCGCCAGCTG GGGCAAGCGG CTGATCGACT GGCTCAACAC CTATACCTTT CCCGAGGAGA TGCGGTTTGG CAACCCGGCC TATGCCGCAG AGATCGCAGC GCGCTACCTG GATCTGACCA CCGCCTGCGG CACCACGACG GTTGCGAGCT ACTGCACCAT CCATCCCGAG AGCGTTGATG CGTTGTTCGA GGCGGCGCAG GCCCGCGGGC AGCGCGTGGT GGCCGGCAAG ACCTGCATGG ACCGCAACGC GCCCGAAGGC CTGCGCGATA CGCCGCAATC CGCCTATGAC GACAGCGCCG CGCTGATCCA GCGCTGGCAT GGGAGGGAGC GGCTCATCTA TGCGATTACG CCACGGTTTT CACCAACCTC CACCCCAGAA CAACTCGAAG CGCTCGGCGC ACTCTGGACG GCGCATCCTA CCTGCCTGAT GCAGACCCAT CTGAGCGAGC AACTGGACGA GATCGAATGG GTCCGAACCC TCTTTCCCGA GGCGCGCGAC TACCTTGACA CCTACGAGCG CTACGGTCTT CTGCGCGAAG GCGCGCTCTT TGGCCATGCC ATCCATCTTG AACCCCGCGA ACGCGCCCGC CTCCTGGAGG CCCGCGCCAG CCTGATCCAC TGCCCCACCT CCAATACCTT CATCGGTTCA GGGCTTTTTG ACATGAACGG CCTCATGCGC GAGGGCCACC GGATCGGCCT TGCCACCGAC ACCGGCGGCG GGTCGAGTTT TTCGATGCTG CGTACCATGG CCGCCGCCTA TGAGGTCGCC CAGCTGCGGG GCACGCCTTT GCACGCGGCA CAGCTCCTGT GGCTTGCAAC CCTCGGGTCG GCCACGGCAC TTGGGCTTCA GGACAAGGTC GGTAATCTCG CCGTGGGACG GGAGGCCGAC CTTGTGGTGC TTGATCTTGC CTCGACCCCG GCCATTGCCC AGCGTGCGAC ACGCGCAGAG ACCCTCTGGG AGGCCCTCTT TCCCACGCTG ATGATGGGCG ATGACCGCGC CATTGCCGAG GTCCGCATCA ACGGGCAGAA AGCGGTCTAG
|
Protein sequence | MKQDLILLGR VLSFCASPFA PDGPEAAVEE HEAIALRAGK IVALGSRAAL SKVLPEAEIV DHGQKLIVPG FVDAHVHYPQ TAIIASWGKR LIDWLNTYTF PEEMRFGNPA YAAEIAARYL DLTTACGTTT VASYCTIHPE SVDALFEAAQ ARGQRVVAGK TCMDRNAPEG LRDTPQSAYD DSAALIQRWH GRERLIYAIT PRFSPTSTPE QLEALGALWT AHPTCLMQTH LSEQLDEIEW VRTLFPEARD YLDTYERYGL LREGALFGHA IHLEPRERAR LLEARASLIH CPTSNTFIGS GLFDMNGLMR EGHRIGLATD TGGGSSFSML RTMAAAYEVA QLRGTPLHAA QLLWLATLGS ATALGLQDKV GNLAVGREAD LVVLDLASTP AIAQRATRAE TLWEALFPTL MMGDDRAIAE VRINGQKAV
|
| |