Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0611 |
Symbol | |
ID | 3786686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 691745 |
End bp | 692965 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637810693 |
Product | phosphoribosylaminoimidazole carboxylase ATPase subunit |
Protein accession | YP_411310 |
Protein GI | 82701744 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0585847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTACGC GGCCTGGGGC GATGCTGGGG TTGCTGGGCG GCGGCCAGCT CGGCAGGATG TTCACCATGG CGGCCCAAAG CCTGGGTTAC AGCGTGACCG TTCTTGATCC CGAGGAGCGG AGCCCGGCGG GTAGCATCGC TGAGCGGCAC CTGCGCGCCG ATTACCTCGA TCGCGAGGCA TTGAATGAGC TCGCATCCAC CTGCGCAGGC GTCACCACCG AATTCGAAAA CGTACCTGCC GAGGCACTCA GAGTACTGGC TGAACGCTGC ATCGTCAGTC CCGCTGCCGA GAGTGTCGCC ATCGCGCAAG ATCGCATTCT GGAGAAGAAT TTTCTCGCTG CCAACGGCTT TGGGGTCGCG CCTTATACAG TGATACAGAA CGCGCGCACT TCGCAGGATG CGACCAATCC GGATGGGGCA TTACCCCAGA GGCAACCGCA ATTTTTTCCT GACCTCTTCC CCGGAATTCT CAAGGTAAGC CGGTTTGGAT ATGACGGCAA AGGCCAGGTG CGTGTAAGTA ATGCATCAGA GCTTGACAGC GCATTTGCAG GTTTGAACCG GGAATCCTGC GTCCTGGAAA AACTGCTGCC CCTGGAGCGC GAGGTATCCG TCATCGTATC CAGGGGATTT GATGGCGAAG TGGTTACTTT TCCCGTATCC GAGAATCAGC ACTGTAACGG TATCCTCGAC ATCAGCATCG TTCCCGCCAG AGTATCGCCT GAAATTGCCC GAAACGCCTG CGATATTGCG ATTCGCATTG CCGAGAAGCT GGATTATCGC GGCGTATTAT GCGTCGAGTT TTTTGTGCTC GCGGAGGGAC GCCTGCTGGT GAACGAGATC GCTCCTCGTC CTCATAATAG CGGGCATTAC TCGATCAATG CCTGCGTTAC CTCTCAATTC GAGCAGCAGG TGCGCATACT TTGCAGAATG CCCCTCGGCA GTACTTCCAT GCATGGAGCG GCGGTAATGG TGAATCTGCT GGGCGATCTC TGGCAGAAGG GAGAGCCGGA GTGGGAACAG GTGCTGCGGC ATTCCGACGT CAAATTGCAT TTATACGGCA AGCGCGATGC CAGACCCGGA CGGAAAATGG GGCACTACAC GGTATTGGCA GAAACGACCG ACGCCGCGTT GCGGCTGGCG CTGGAAATCA AGCAATCGCT GCAACCCTCT CATAGTACTC TTGAAACACA TGAGTCTCCC CGGAACGAAA GTCGAAACTG A
|
Protein sequence | MSTRPGAMLG LLGGGQLGRM FTMAAQSLGY SVTVLDPEER SPAGSIAERH LRADYLDREA LNELASTCAG VTTEFENVPA EALRVLAERC IVSPAAESVA IAQDRILEKN FLAANGFGVA PYTVIQNART SQDATNPDGA LPQRQPQFFP DLFPGILKVS RFGYDGKGQV RVSNASELDS AFAGLNRESC VLEKLLPLER EVSVIVSRGF DGEVVTFPVS ENQHCNGILD ISIVPARVSP EIARNACDIA IRIAEKLDYR GVLCVEFFVL AEGRLLVNEI APRPHNSGHY SINACVTSQF EQQVRILCRM PLGSTSMHGA AVMVNLLGDL WQKGEPEWEQ VLRHSDVKLH LYGKRDARPG RKMGHYTVLA ETTDAALRLA LEIKQSLQPS HSTLETHESP RNESRN
|
| |