Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0963 |
Symbol | |
ID | 3785754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 1118067 |
End bp | 1119407 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637811046 |
Product | amidohydrolase |
Protein accession | YP_411658 |
Protein GI | 82702092 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.296396 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGCG TCTGCCTGAA AATCACCAGC CTCATCAGCC TCGCCAGCCG AATCACGACA GGTTTATCGT TGATGTCGGT CATGCTGGCC TCCCCTCTGG CTGTTGCCCA GATGGGGAAG CACGACGCGC TTCTCCTGCA TGCCGCACGG GTTTTCGACG GCCACAGGAT GAGGACAAAC ACATCGGTAC AGGTAGTAAA CGGAAAGGTA ACGCAAATCG GCCCACGCGG CTCATTCGCC CCCGATAATG CAAAAGTCAT CGATCTGGGG GACGCGACGA TACTGCCCGG TTTTATCGAA CTGCATGCCC ATCTTTCGTT CCAGCATGTT CCTGCCGATA CTGTATTGAA GCATGGGATT ACCACAATCC GGGACGTAGG CGGACCGGTA CACAAGCCCT ATGGCGGAAA TGGCACCCTG CGCATGCTTA CCTCGGGACC CATCATCACC GCACCTGGCG GTTATCCTAT CCCGCTGACA GGGGAAAAGA ATATCGCAAC ACCGGTATCC ACGGAGAAAG AAGCAAGAGA AACAGTCCGA AACCTGATCG ATGCGGGCGC GGTGGTGATC AAGATTGCCC TGGAACCCGG CGGCGAGCCG GGCGCGCCGT GGTCGTGCGG CCATGCGCAT GTGCATCACC ACAACCATGC GGCCGCCGGT AATCATGCTC ATCATGCACA GCCCAATCAC GAACCGCCAA ACCACGAAGC GGCAAACGAC GCCCATTCCT CTCACCCACA TCCCAATACT GGTCTCGGTA CTGCCTGGCC CCTGCTCGCT GAAAGAGTCG TAAAGGCGAT AGTGGACGAA GCCCATGTGA ACAAACGCAG GGTAACGGCC CATATCGCCG AAGTAAAAGG CGCGGAGATT GCCATCAACG CGGGAATCGA TGAGTGGGCT CATGTTCCCT GCGACATCAT CCCCGAGCCT CTGTTAAAAA AAGCCGTGTC ACAGGGTGTC AGAATAGTTA CGACGCTCGA TACCCTGTCG AAATGTCCCG GTGTTGCGCA CAATACCCGG GTCTGGCGGG AACAGGGTGG CGAACTGCTG TACGGCGCTG AAATCGCACA CCCGGATGTT CCCAGGGGAA TCGACGCCCA AGAGCTTATG TACATGATGC AGATGGGAAA TATGGAAACC CTGGAGGTCC TGCGCGCGGC CACGTCGAAA GCGGGGGAAC ACCTCGGCTT GCCCCTTCTC GGCACCCTTC TGCCCGGCGC GCCCGCCGAC GTGATCGCCA TCAAGGGGGA TCCCACCCAT AACCTGAAAA ATCTGGAATA CCCGGATCTG GTTATTTCGG GCGGAGAAAT TGTTTTGAAT AATTTTCCAG CTGCTTTCTA G
|
Protein sequence | MKSVCLKITS LISLASRITT GLSLMSVMLA SPLAVAQMGK HDALLLHAAR VFDGHRMRTN TSVQVVNGKV TQIGPRGSFA PDNAKVIDLG DATILPGFIE LHAHLSFQHV PADTVLKHGI TTIRDVGGPV HKPYGGNGTL RMLTSGPIIT APGGYPIPLT GEKNIATPVS TEKEARETVR NLIDAGAVVI KIALEPGGEP GAPWSCGHAH VHHHNHAAAG NHAHHAQPNH EPPNHEAAND AHSSHPHPNT GLGTAWPLLA ERVVKAIVDE AHVNKRRVTA HIAEVKGAEI AINAGIDEWA HVPCDIIPEP LLKKAVSQGV RIVTTLDTLS KCPGVAHNTR VWREQGGELL YGAEIAHPDV PRGIDAQELM YMMQMGNMET LEVLRAATSK AGEHLGLPLL GTLLPGAPAD VIAIKGDPTH NLKNLEYPDL VISGGEIVLN NFPAAF
|
| |