Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_1476 |
Symbol | |
ID | 4077773 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 1576197 |
End bp | 1577243 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 638006787 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_613471 |
Protein GI | 99081317 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.186748 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGCG GCAAGAACGG GCTGACCTAT GCGGATGCAG GGGTCGATAT TGATGCGGGC AACGAACTTG TGGACCGGAT CAAACCGGCC GCTAAGCGAA CCAACCGCCC CGGCGTGATG AGCGGCCTTG GCGGCTTTGG CGCGCTGTTT GACCTGAAGG CTGCTGGCTA CGAGGATCCA ATCCTTGTAG GCGCGACCGA TGGGGTGGGC ACTAAGCTGC GGATTGCCAT TGATACGGGT CTCGTGGACG GCGTCGGCAT CGATCTTGTG GCGATGTGTG TCAACGATCT GGTTTGTCAG GGCGCAGAGC CGCTGTTCTT TCTCGACTAT TTTGCCACCG GCAAGCTGGA AACCGATGTC GCGGCGCGAA TCATCGAAGG CATCGCCGAA GGCTGCGTGC GCTCTGGCTG TGCGCTGATC GGCGGCGAGA CCGCCGAGAT GCCCGGCATG TACCCCAAAG GTGATTTTGA CCTCGCCGGG TTTGCCGTGG GCGCCATGGA GCGTGGCACA GCCCTGCCGG CAGGTGTCAG CGAAGGCGAT GTCCTCTTGG GGCTGGCCTC CGATGGCGTG CATTCCAACG GCTACTCTCT GGTGCGGCAG ATCGTGAAAT ACTCCGGTCT GGGCTGGGAT GGCGACAACC CGTTTGGCGA GGGCAAACTG GGCGAGGCGC TGCTGACCCC CACGCGTCTC TATGTCAAAC AATCCCTTGC CGCAGTGCGG GCCGGGGGCG TCAATGCACT TGCGCATATC ACAGGCGGCG GCCTCACGGA GAACCTGCCG CGGGTTCTGC CCGACGACCT TGGTGCGGAC ATCGACCTCG GCGCCTGGGA GCTTCCGGGT GTGTTCAAGT GGATGGCGCA AACCGGTGGC ATTGAAGAGA GCGAAATGCT CAAGACCTTC AACTGCGGGA TTGGCATGAT CCTCGTTGTG AAAGCCGACC GGGCCGATGC GCTCACCGAG GTGCTCGAGG GTGAGGGCGA GACCGTTGCG CGGCTTGGCA CTGTGACGCG CGGGGAGGGT ATTCGCTACA CGGGCGCGCT GCTGTGA
|
Protein sequence | MTSGKNGLTY ADAGVDIDAG NELVDRIKPA AKRTNRPGVM SGLGGFGALF DLKAAGYEDP ILVGATDGVG TKLRIAIDTG LVDGVGIDLV AMCVNDLVCQ GAEPLFFLDY FATGKLETDV AARIIEGIAE GCVRSGCALI GGETAEMPGM YPKGDFDLAG FAVGAMERGT ALPAGVSEGD VLLGLASDGV HSNGYSLVRQ IVKYSGLGWD GDNPFGEGKL GEALLTPTRL YVKQSLAAVR AGGVNALAHI TGGGLTENLP RVLPDDLGAD IDLGAWELPG VFKWMAQTGG IEESEMLKTF NCGIGMILVV KADRADALTE VLEGEGETVA RLGTVTRGEG IRYTGALL
|
| |