Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0943 |
Symbol | |
ID | 3785203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 1090857 |
End bp | 1094495 |
Gene Length | 3639 bp |
Protein Length | 1212 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637811026 |
Product | allophanate hydrolase subunit 2 |
Protein accession | YP_411638 |
Protein GI | 82702072 |
COG category | [C] Energy production and conversion [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG1038] Pyruvate carboxylase [COG2049] Allophanate hydrolase subunit 1 [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain [TIGR02712] urea carboxylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.648902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGAAA AAGTATTGAT AGCCAATCGC GGCGCCATTG CTTGTCGGAT TATCCGCACT CTACGGCGTA TGGGGGTAAA GAGCGTTGCC ATCTATACCG AAGCAGATGC GTTATCCCGG CATGTGATCG AAGCCGATGA AGCCTACTGC ATAGGCAGTG GAGTCGCCGC GGAAAGCTAT CTGCGCGCCG AAAAGATCCT GGAGGTTGCG AGCCATGCGG GGGCAAATGC AATTCATCCA GGATATGGCT TCCTCAGTGA GAAAGCTGAA TTTGCCGAGC AATGCGCTGA CCATGGTATT TCCTTCATTG GCCCCACTCC CCATCAGATG CGCGCGTTCG GCTTGAAGCA CACGGCGCGC AAACTGGCGC TGCAGAACCG GGTACCGCTG CTGCCGGGCA CAGGCTTGCT CGAAGACCTG GATGAGGCCT TGCGCCAGGC AGCTCATATC GGTTACCCGG TCATGCTGAA AAGCACGGCG GGAGGGGGCG GCATAGGCAT ACGTTTGTGC TGGAACAAGG AAGAGTTGAG TGCAAACTAC GAATTGGTGA AATACCTCGC GCAGAACAAT TTCAAGGACG CCGGCCTGTT TCTCGAAAAA TATGTGGAGA AGGCGCGCCA TATCGAGGTG CAGATATTCG GTGACGGCAA GGGTGGGGTG ATCGCACTGG GGGAGCGCGA CTGCTCGATG CAGCGGCGTA ACCAAAAGGT GATCGAAGAA ACCCCGGCCC CCGATCTGCC GCCACGCGTG CGCCAGGCGT TGCTGAATGC CGCGGTACGT TTGGGCAAGT CAGTCAATTA CCAGTCTGCG GGTACGGTGG AGTATATTTT CGATGCCTCC GCTGCAGAAT TTTATTTCCT GGAAGTGAAT ACGCGGCTGC AGGTCGAGCA CGGGGTAACT GAGGAAGTGA CCGGTATCGA CTTGGTGGAA TGGATGGTTC GGCAAGCCGC CGGAGACTTG CCCCCTCTCG ATTCATTCGA TATCCAGCCG CAGGGGGCCT CCCTCCAGGT ACGCGTGTAC GCCGAGAACG CGGTCAAAGA TTTTCAACCT TCCTGCGGTA TTCTCACCGC CGCAGAGTTT CCGCCTCCGT CCGCGGCACG CGTGGAAACC TGGGTGGAAC GCGGCGTCGA GGTCTCGCCA TTTTATGATC CTATGCTGGC AAAAATCATC GTGCATGCGC CGGACCGGGA GCAGGCCATT GCGCGGCTGC TGCAAGCCCT GGACGTGACG GCATTGCACG GAGTGGAAAC CAACGTCGGT TATCTCAGGC AGACTCTACG CAGCGATGCG TTTCGAAGCG CCCAACACAC GACCAGCTTT CTGAACACTT TTCGTTATGC CGCCCATACG ATCGATGTTC TCAGTCCGGG TGTACAGACT ACGGTGCAGG ATTATCCCGG GCGGACAGGA TACTGGAGTA TCGGCGTGCC GCCATCGGGA CCGATGGATG GTCTGGCGTT CCGCCTGGCC AATCGTCTCG TCGATAACAG CGAGGATAAG GCGGGGTTGG AGATTACGCT TTCCGGTCCG ACGCTGCACT TCAACTGTGA CAGCGTTATT GCGGTGTGCG GTGCGCCGAT GGAGGTGCGC CTGGATGGTG AGCCGCTTGC CTATTGGCGA GCGCATCGCG TCAAGGCCGG TTCGTTGTTG CAGTTCGGCA AACTCGTAAA TCATGGGTGC CGCGCTTATC TGGCGGTGCA AGGAGGAATT CGGGTTCCCG ATTATCTGGG CAGCAAATCC ACATTCACCC TCGGGCACTT CGGGGGGCAT GCAGGCCGCA CGCTCCTCAC CGGCGACGTG CTGCATATCT TTGAAGCCAG AAAAGACGGT AATGGCGGAT TTGAGCAGGA ACTGCCGGAT GAATTGGTAC CGCCCTACAC CGACAGCTGG AAAATAGGGG TGTTGTACGG GCCGCACGGG GCGCCGGATT TTTTCACGGA GCAGGATATC GAAACCTTCT TCGCCACGGA TTGGGAAGTG CATTACAACT CCAATCGCAC CGGCGTGCGG CTGATCGGCC CCAAGCCCCA TTGGGCGCGC AGTGACGGCG GCGAAGCAGG ACTGCATCCC TCCAATATCC ACGACAACGC CTATGCCGTA GGTACCGTGG ATTTTACGGG AGATATGCCT GTGATACTTG GTCCCGATGG CCCCAGCCTT GGCGGCTTCG TGTGCCCGGT TACCATCATC CAGGCTGAAT TTTGGAAGAT GGGACAGCTC AAGCCGGGTG ATCGCGTGCG CTTCCACAGG ATGTCGATGG AGCAGGCGCT GGGGCTGGAG TTGCAGCAGG ATGCAAGGAT AAAAGACCTC CGGGTCCCGC AAAACGTTTC ATCCGAGGGG GAGCAGGACG CAACCGACGC ATCCATGCCT GTGCTCCACT TCATCCCGCA AAGCGACGGC CATGTACAGG TGGTATATCG CCAGGCAGGC GATAAAAATC TGCTGGTCGA GTATGGTCCC CTCGAGCTCG ATCTCAATTT GCGCTTCCGT GCGCACGCCC TGATGGATTG GGTGCAGAAA ACGTGCAATG ACGGAGAACT GAAAGGCATT CTGGATCTGA CACCCGGCAT CCGTTCGCTG CAAGTGCATT TCGATTCCCG CGTACTCCCA CGCGATAAGC TGCTGGAAAT GCTGGTCAGC GCGGAAAAGA AATTACCCGA TATCGATGAT ATGGAGGTCC CGGCACGCGT CGTCCATCTT CCACTGTCAT GGGACGATGG TGCCACCCGC CTGGCGATAG AAAAGTACAT GCAGTCGGTG CGCAGCGATG CACCCTGGTG CCCCAGCAAT ATCGAGTTCA TCCGCCGGAT CAACGGCCTC GACAGCATCG AGGAAGTGCA GCACATCCTG TTTTCTGCGA ATTATCTTGT CATGGGACTG GGTGACGTCT ATCTCGGTGC GCCGGTTGCC ACGCCCGTGG ATCCGCGTCA TCGGCTGGTG ACCACGAAGT ATAATCCTGC GCGTACCTGG ACGCCCGAGA ATGCGGTTGG CATAGGGGGG GCCTATTTGT GCATATACGG GATGGAAGGT CCCGGAGGTT ACCAGTTTGT GGGCCGCACG GTGCAGATGT GGAACCGGTA CCTCCAGACA GCCGACTTCA AGGAAGGAAA ACCCTGGCTG CTGCGTTTCT TTGACCAGAT TCGTTTCTAT CCGGTTAGCG AAAGCGAACT GCTCAAGTTG CGCAAGGATT TCATTACCGG ACACTTCAAG CTGAAGATCG AGGAGACAAC ATTCAGTTTG AAACAGTACA ACGCTTTCCT GAAAGAAAAT GCGGGATCCA TCAGCGCTTT CAAAGCAAAG CAGCAGGCTG CGTTTGAAGC CGAGCGTGAA CGCTGGAAGG CCCAAGGCAA GGCTGAGTAC GTGAGCGAGG TTACACTCGA GGAAGCAGAT GCGCAGAGCG AACTGGATTT GCCCGCTGAT TCCCAGATTG TCAGTGCGCA TGTAACCGGC ACGGTATGGA AACTGCTCGT CAAGGAAGGG CAGCGTGTCG AAACGGGAGA TCCAGTGGTA GTGGTGGAGT CCATGAAAAT GGAATTCTCT GTGGAGACAC CGGTCAGCGG TAGGGTACGA CAGCTATTCT GCAAGGAGGG GAGCCATATA TCCGCCGGGC AGATGTTGCT TATCGTTCAG GAGGAATGA
|
Protein sequence | MFEKVLIANR GAIACRIIRT LRRMGVKSVA IYTEADALSR HVIEADEAYC IGSGVAAESY LRAEKILEVA SHAGANAIHP GYGFLSEKAE FAEQCADHGI SFIGPTPHQM RAFGLKHTAR KLALQNRVPL LPGTGLLEDL DEALRQAAHI GYPVMLKSTA GGGGIGIRLC WNKEELSANY ELVKYLAQNN FKDAGLFLEK YVEKARHIEV QIFGDGKGGV IALGERDCSM QRRNQKVIEE TPAPDLPPRV RQALLNAAVR LGKSVNYQSA GTVEYIFDAS AAEFYFLEVN TRLQVEHGVT EEVTGIDLVE WMVRQAAGDL PPLDSFDIQP QGASLQVRVY AENAVKDFQP SCGILTAAEF PPPSAARVET WVERGVEVSP FYDPMLAKII VHAPDREQAI ARLLQALDVT ALHGVETNVG YLRQTLRSDA FRSAQHTTSF LNTFRYAAHT IDVLSPGVQT TVQDYPGRTG YWSIGVPPSG PMDGLAFRLA NRLVDNSEDK AGLEITLSGP TLHFNCDSVI AVCGAPMEVR LDGEPLAYWR AHRVKAGSLL QFGKLVNHGC RAYLAVQGGI RVPDYLGSKS TFTLGHFGGH AGRTLLTGDV LHIFEARKDG NGGFEQELPD ELVPPYTDSW KIGVLYGPHG APDFFTEQDI ETFFATDWEV HYNSNRTGVR LIGPKPHWAR SDGGEAGLHP SNIHDNAYAV GTVDFTGDMP VILGPDGPSL GGFVCPVTII QAEFWKMGQL KPGDRVRFHR MSMEQALGLE LQQDARIKDL RVPQNVSSEG EQDATDASMP VLHFIPQSDG HVQVVYRQAG DKNLLVEYGP LELDLNLRFR AHALMDWVQK TCNDGELKGI LDLTPGIRSL QVHFDSRVLP RDKLLEMLVS AEKKLPDIDD MEVPARVVHL PLSWDDGATR LAIEKYMQSV RSDAPWCPSN IEFIRRINGL DSIEEVQHIL FSANYLVMGL GDVYLGAPVA TPVDPRHRLV TTKYNPARTW TPENAVGIGG AYLCIYGMEG PGGYQFVGRT VQMWNRYLQT ADFKEGKPWL LRFFDQIRFY PVSESELLKL RKDFITGHFK LKIEETTFSL KQYNAFLKEN AGSISAFKAK QQAAFEAERE RWKAQGKAEY VSEVTLEEAD AQSELDLPAD SQIVSAHVTG TVWKLLVKEG QRVETGDPVV VVESMKMEFS VETPVSGRVR QLFCKEGSHI SAGQMLLIVQ EE
|
| |