Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0603 |
Symbol | |
ID | 4078641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 643704 |
End bp | 644801 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638005900 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_612598 |
Protein GI | 99080444 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.094732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.303963 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATC TCGACCAAAT CCTCGAACCC GGTGCCGTAA TCGGCATTCT GGGCGGTGGT CAGCTTGGCC GGATGCTGTC GGTTGCAGCC TCCCGCCTTG GCTTCACCAC CCACATCTAC GAGCCGGCCG CGAATCCGCC TGCGGGCCAT GTTGCGGATC GGGTCACAAC CGCTGCCTAT GAGGATGAGG CGGCCTTGCG CGCCTTTGCA CAAACCGTGG ATGTGATCAC CTATGAGTTT GAAAATATCC CCACCTCAGC GCTCGACATT CTCGAGGATC TGAAACCGAT CCGCCCCGGT CGCGAGGCGC TTCGGGTGTC GCAGGACCGC CTGACGGAAA AGACCTTTCT GCAAGAGCTT GGGCTGGCAA CAGCGCCCTT TGCAGAGGTG GATGATGCCG CCAGCCTTGA AGTTGCATTG GAAAAGGTCG GCACGCCTTC GATCCTGAAA ACGCGGCGTT TTGGCTATGA TGGCAAGGGT CAGATGCGCA TCATGCATCC CGAAGAAGCT CCGGAGGCCT TGGCCGCCAT GCAAGGAGCG CCCGCGGTGC TCGAAGGGTT TGTCCCCTTC AGCCACGAGG TCTCGGTCAT CGCAGCCCGC AGCCTCAATG GCAAAGTCGC CTGCTATGAT CCGGGCGAGA ACGTGCATCT CGAAGGCATC CTCAGCACCA CCACCGTGCC CGCAAACCTC ACGACGGGTC AGAGCATGGA TGCGGTGCTG ATGGCCGGTA AGATCCTGAA CGCCCTCGAT TACGTTGGTG TTCTGGCGGT CGAGATCTTT GTCACCGCGC ATGGCCTAGT GGTCAATGAA ATCGCCCCGC GCGTGCACAA CTCGGGTCAC TGGACGCAGG AAGGCTGCAC CATCGACCAG TTCGAACAGC ACATCCGGGC CATCACAGGC TGGACCATCG GCAATGGCGC CCGTTACGCC GATGTTGTGA TGGAAAACCT GATTGGCGAG GACGTGGGAC GTATCGCCGA TCTTGCACGC GATCCCGATT GCGCCATTCA TCTCTATGGC AAGGCCGAAA CAAAGCCTGG CCGCAAGATG GGCCATGTAA ACCGTGTCAT CCACACCGAA GGGCGCGAGG GCTACTGA
|
Protein sequence | MTDLDQILEP GAVIGILGGG QLGRMLSVAA SRLGFTTHIY EPAANPPAGH VADRVTTAAY EDEAALRAFA QTVDVITYEF ENIPTSALDI LEDLKPIRPG REALRVSQDR LTEKTFLQEL GLATAPFAEV DDAASLEVAL EKVGTPSILK TRRFGYDGKG QMRIMHPEEA PEALAAMQGA PAVLEGFVPF SHEVSVIAAR SLNGKVACYD PGENVHLEGI LSTTTVPANL TTGQSMDAVL MAGKILNALD YVGVLAVEIF VTAHGLVVNE IAPRVHNSGH WTQEGCTIDQ FEQHIRAITG WTIGNGARYA DVVMENLIGE DVGRIADLAR DPDCAIHLYG KAETKPGRKM GHVNRVIHTE GREGY
|
| |