Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_2976 |
Symbol | |
ID | 4253547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 3555062 |
End bp | 3556159 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 638119612 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_735104 |
Protein GI | 113971311 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000387577 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCAAG CAAGTGCAAA ACCTAAGGTT TGGGTATTAG GTAATGGTCA GCTCGGCGCC ATGCTAACTC ACGCTGGCGA GCCGCTGGCT ATTGATGTTC GCGCCGTCGA TATCATGACG CCCACCGATG AAATCCTGCC GCTGGCACCT CACGATATCA TCACCGCCGA ACGCGAGCAG TGGCCCGAAT CCGCCCTAAG CTTACAACTC AGCACCCATC CGCATTTTGT TAACGGCCCA GTATTTAGCC GTTTAGCTGA CCGTTATAGC CAAAAAAGCT TACTCGATCA GATCCAAGTC CCAACCGCGC CTTGGACACT CGTAGATGAT AAGACCGACG CCGAAAACCT ATACCAAGCC TTTGGTCCTC GCGTGCTGAT GAAGCGCCGC ACGGGTGGCT ACGACGGTAA GGGTCAACAT TGGCTAAAAC AAGCCGAAGC GGGAGAGATC CCCCAAGATT GGCGTAACTT GGCCATTGCC GAGCAAGCAA TTAACTTCGA TGAAGAAGTG TCCTTAGTCG GCGTACGTAC CCGTGAAGGT CAATGCGTGT TTTATCCATT AACACTTAAC CTGCATCAAG ACGGCATTTT GATGGCATCG ATTTCGCCAC TGACTCGTCT GAATCACCTG CAAGCACAAG CGGAAGGCAT GCTGAGCGCT ATTATGCATG AGCTGGAATA TGTCGGCGTG ATGGCGATGG AATGTTTCCG AGTTGGCGAT AAGCTGCTGG TTAACGAGTT AGCCCCGCGG GTGCACAACT CTGGCCATTG GACCCAAGCG GGTACCCATA TGGATCAATT CCAACTGCAT TTAAGAGCCC TGTGCGGCAT TGCTATTCCG CAGCCACAGG TAAACTTTCA ATGCGTGATG GTCAACCTGA TTGGTATCGA TAACGATACC CGTTGGTTAA GCCTGCCTAA TGCCGAGCTT TATTGGTACA ACAAAGAAGT GCGATCTGGC CGTAAAGTCG GGCACTTAAA TCTTTCGGTA CCTAATCTTG CCGTGTTAAA AGACAGCATT GGCGCACTAC AAACTTGGAT GCCAAGCCAA TATCAAGCGC CCCTCGCGTG GATTTTAGCA GAGTTTGCTA AAGACTAA
|
Protein sequence | MTQASAKPKV WVLGNGQLGA MLTHAGEPLA IDVRAVDIMT PTDEILPLAP HDIITAEREQ WPESALSLQL STHPHFVNGP VFSRLADRYS QKSLLDQIQV PTAPWTLVDD KTDAENLYQA FGPRVLMKRR TGGYDGKGQH WLKQAEAGEI PQDWRNLAIA EQAINFDEEV SLVGVRTREG QCVFYPLTLN LHQDGILMAS ISPLTRLNHL QAQAEGMLSA IMHELEYVGV MAMECFRVGD KLLVNELAPR VHNSGHWTQA GTHMDQFQLH LRALCGIAIP QPQVNFQCVM VNLIGIDNDT RWLSLPNAEL YWYNKEVRSG RKVGHLNLSV PNLAVLKDSI GALQTWMPSQ YQAPLAWILA EFAKD
|
| |