Gene Nmul_A0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0943 
Symbol 
ID3785203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1090857 
End bp1094495 
Gene Length3639 bp 
Protein Length1212 aa 
Translation table11 
GC content57% 
IMG OID637811026 
Productallophanate hydrolase subunit 2 
Protein accessionYP_411638 
Protein GI82702072 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1038] Pyruvate carboxylase
[COG2049] Allophanate hydrolase subunit 1
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain
[TIGR02712] urea carboxylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.648902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGAAA AAGTATTGAT AGCCAATCGC GGCGCCATTG CTTGTCGGAT TATCCGCACT 
CTACGGCGTA TGGGGGTAAA GAGCGTTGCC ATCTATACCG AAGCAGATGC GTTATCCCGG
CATGTGATCG AAGCCGATGA AGCCTACTGC ATAGGCAGTG GAGTCGCCGC GGAAAGCTAT
CTGCGCGCCG AAAAGATCCT GGAGGTTGCG AGCCATGCGG GGGCAAATGC AATTCATCCA
GGATATGGCT TCCTCAGTGA GAAAGCTGAA TTTGCCGAGC AATGCGCTGA CCATGGTATT
TCCTTCATTG GCCCCACTCC CCATCAGATG CGCGCGTTCG GCTTGAAGCA CACGGCGCGC
AAACTGGCGC TGCAGAACCG GGTACCGCTG CTGCCGGGCA CAGGCTTGCT CGAAGACCTG
GATGAGGCCT TGCGCCAGGC AGCTCATATC GGTTACCCGG TCATGCTGAA AAGCACGGCG
GGAGGGGGCG GCATAGGCAT ACGTTTGTGC TGGAACAAGG AAGAGTTGAG TGCAAACTAC
GAATTGGTGA AATACCTCGC GCAGAACAAT TTCAAGGACG CCGGCCTGTT TCTCGAAAAA
TATGTGGAGA AGGCGCGCCA TATCGAGGTG CAGATATTCG GTGACGGCAA GGGTGGGGTG
ATCGCACTGG GGGAGCGCGA CTGCTCGATG CAGCGGCGTA ACCAAAAGGT GATCGAAGAA
ACCCCGGCCC CCGATCTGCC GCCACGCGTG CGCCAGGCGT TGCTGAATGC CGCGGTACGT
TTGGGCAAGT CAGTCAATTA CCAGTCTGCG GGTACGGTGG AGTATATTTT CGATGCCTCC
GCTGCAGAAT TTTATTTCCT GGAAGTGAAT ACGCGGCTGC AGGTCGAGCA CGGGGTAACT
GAGGAAGTGA CCGGTATCGA CTTGGTGGAA TGGATGGTTC GGCAAGCCGC CGGAGACTTG
CCCCCTCTCG ATTCATTCGA TATCCAGCCG CAGGGGGCCT CCCTCCAGGT ACGCGTGTAC
GCCGAGAACG CGGTCAAAGA TTTTCAACCT TCCTGCGGTA TTCTCACCGC CGCAGAGTTT
CCGCCTCCGT CCGCGGCACG CGTGGAAACC TGGGTGGAAC GCGGCGTCGA GGTCTCGCCA
TTTTATGATC CTATGCTGGC AAAAATCATC GTGCATGCGC CGGACCGGGA GCAGGCCATT
GCGCGGCTGC TGCAAGCCCT GGACGTGACG GCATTGCACG GAGTGGAAAC CAACGTCGGT
TATCTCAGGC AGACTCTACG CAGCGATGCG TTTCGAAGCG CCCAACACAC GACCAGCTTT
CTGAACACTT TTCGTTATGC CGCCCATACG ATCGATGTTC TCAGTCCGGG TGTACAGACT
ACGGTGCAGG ATTATCCCGG GCGGACAGGA TACTGGAGTA TCGGCGTGCC GCCATCGGGA
CCGATGGATG GTCTGGCGTT CCGCCTGGCC AATCGTCTCG TCGATAACAG CGAGGATAAG
GCGGGGTTGG AGATTACGCT TTCCGGTCCG ACGCTGCACT TCAACTGTGA CAGCGTTATT
GCGGTGTGCG GTGCGCCGAT GGAGGTGCGC CTGGATGGTG AGCCGCTTGC CTATTGGCGA
GCGCATCGCG TCAAGGCCGG TTCGTTGTTG CAGTTCGGCA AACTCGTAAA TCATGGGTGC
CGCGCTTATC TGGCGGTGCA AGGAGGAATT CGGGTTCCCG ATTATCTGGG CAGCAAATCC
ACATTCACCC TCGGGCACTT CGGGGGGCAT GCAGGCCGCA CGCTCCTCAC CGGCGACGTG
CTGCATATCT TTGAAGCCAG AAAAGACGGT AATGGCGGAT TTGAGCAGGA ACTGCCGGAT
GAATTGGTAC CGCCCTACAC CGACAGCTGG AAAATAGGGG TGTTGTACGG GCCGCACGGG
GCGCCGGATT TTTTCACGGA GCAGGATATC GAAACCTTCT TCGCCACGGA TTGGGAAGTG
CATTACAACT CCAATCGCAC CGGCGTGCGG CTGATCGGCC CCAAGCCCCA TTGGGCGCGC
AGTGACGGCG GCGAAGCAGG ACTGCATCCC TCCAATATCC ACGACAACGC CTATGCCGTA
GGTACCGTGG ATTTTACGGG AGATATGCCT GTGATACTTG GTCCCGATGG CCCCAGCCTT
GGCGGCTTCG TGTGCCCGGT TACCATCATC CAGGCTGAAT TTTGGAAGAT GGGACAGCTC
AAGCCGGGTG ATCGCGTGCG CTTCCACAGG ATGTCGATGG AGCAGGCGCT GGGGCTGGAG
TTGCAGCAGG ATGCAAGGAT AAAAGACCTC CGGGTCCCGC AAAACGTTTC ATCCGAGGGG
GAGCAGGACG CAACCGACGC ATCCATGCCT GTGCTCCACT TCATCCCGCA AAGCGACGGC
CATGTACAGG TGGTATATCG CCAGGCAGGC GATAAAAATC TGCTGGTCGA GTATGGTCCC
CTCGAGCTCG ATCTCAATTT GCGCTTCCGT GCGCACGCCC TGATGGATTG GGTGCAGAAA
ACGTGCAATG ACGGAGAACT GAAAGGCATT CTGGATCTGA CACCCGGCAT CCGTTCGCTG
CAAGTGCATT TCGATTCCCG CGTACTCCCA CGCGATAAGC TGCTGGAAAT GCTGGTCAGC
GCGGAAAAGA AATTACCCGA TATCGATGAT ATGGAGGTCC CGGCACGCGT CGTCCATCTT
CCACTGTCAT GGGACGATGG TGCCACCCGC CTGGCGATAG AAAAGTACAT GCAGTCGGTG
CGCAGCGATG CACCCTGGTG CCCCAGCAAT ATCGAGTTCA TCCGCCGGAT CAACGGCCTC
GACAGCATCG AGGAAGTGCA GCACATCCTG TTTTCTGCGA ATTATCTTGT CATGGGACTG
GGTGACGTCT ATCTCGGTGC GCCGGTTGCC ACGCCCGTGG ATCCGCGTCA TCGGCTGGTG
ACCACGAAGT ATAATCCTGC GCGTACCTGG ACGCCCGAGA ATGCGGTTGG CATAGGGGGG
GCCTATTTGT GCATATACGG GATGGAAGGT CCCGGAGGTT ACCAGTTTGT GGGCCGCACG
GTGCAGATGT GGAACCGGTA CCTCCAGACA GCCGACTTCA AGGAAGGAAA ACCCTGGCTG
CTGCGTTTCT TTGACCAGAT TCGTTTCTAT CCGGTTAGCG AAAGCGAACT GCTCAAGTTG
CGCAAGGATT TCATTACCGG ACACTTCAAG CTGAAGATCG AGGAGACAAC ATTCAGTTTG
AAACAGTACA ACGCTTTCCT GAAAGAAAAT GCGGGATCCA TCAGCGCTTT CAAAGCAAAG
CAGCAGGCTG CGTTTGAAGC CGAGCGTGAA CGCTGGAAGG CCCAAGGCAA GGCTGAGTAC
GTGAGCGAGG TTACACTCGA GGAAGCAGAT GCGCAGAGCG AACTGGATTT GCCCGCTGAT
TCCCAGATTG TCAGTGCGCA TGTAACCGGC ACGGTATGGA AACTGCTCGT CAAGGAAGGG
CAGCGTGTCG AAACGGGAGA TCCAGTGGTA GTGGTGGAGT CCATGAAAAT GGAATTCTCT
GTGGAGACAC CGGTCAGCGG TAGGGTACGA CAGCTATTCT GCAAGGAGGG GAGCCATATA
TCCGCCGGGC AGATGTTGCT TATCGTTCAG GAGGAATGA
 
Protein sequence
MFEKVLIANR GAIACRIIRT LRRMGVKSVA IYTEADALSR HVIEADEAYC IGSGVAAESY 
LRAEKILEVA SHAGANAIHP GYGFLSEKAE FAEQCADHGI SFIGPTPHQM RAFGLKHTAR
KLALQNRVPL LPGTGLLEDL DEALRQAAHI GYPVMLKSTA GGGGIGIRLC WNKEELSANY
ELVKYLAQNN FKDAGLFLEK YVEKARHIEV QIFGDGKGGV IALGERDCSM QRRNQKVIEE
TPAPDLPPRV RQALLNAAVR LGKSVNYQSA GTVEYIFDAS AAEFYFLEVN TRLQVEHGVT
EEVTGIDLVE WMVRQAAGDL PPLDSFDIQP QGASLQVRVY AENAVKDFQP SCGILTAAEF
PPPSAARVET WVERGVEVSP FYDPMLAKII VHAPDREQAI ARLLQALDVT ALHGVETNVG
YLRQTLRSDA FRSAQHTTSF LNTFRYAAHT IDVLSPGVQT TVQDYPGRTG YWSIGVPPSG
PMDGLAFRLA NRLVDNSEDK AGLEITLSGP TLHFNCDSVI AVCGAPMEVR LDGEPLAYWR
AHRVKAGSLL QFGKLVNHGC RAYLAVQGGI RVPDYLGSKS TFTLGHFGGH AGRTLLTGDV
LHIFEARKDG NGGFEQELPD ELVPPYTDSW KIGVLYGPHG APDFFTEQDI ETFFATDWEV
HYNSNRTGVR LIGPKPHWAR SDGGEAGLHP SNIHDNAYAV GTVDFTGDMP VILGPDGPSL
GGFVCPVTII QAEFWKMGQL KPGDRVRFHR MSMEQALGLE LQQDARIKDL RVPQNVSSEG
EQDATDASMP VLHFIPQSDG HVQVVYRQAG DKNLLVEYGP LELDLNLRFR AHALMDWVQK
TCNDGELKGI LDLTPGIRSL QVHFDSRVLP RDKLLEMLVS AEKKLPDIDD MEVPARVVHL
PLSWDDGATR LAIEKYMQSV RSDAPWCPSN IEFIRRINGL DSIEEVQHIL FSANYLVMGL
GDVYLGAPVA TPVDPRHRLV TTKYNPARTW TPENAVGIGG AYLCIYGMEG PGGYQFVGRT
VQMWNRYLQT ADFKEGKPWL LRFFDQIRFY PVSESELLKL RKDFITGHFK LKIEETTFSL
KQYNAFLKEN AGSISAFKAK QQAAFEAERE RWKAQGKAEY VSEVTLEEAD AQSELDLPAD
SQIVSAHVTG TVWKLLVKEG QRVETGDPVV VVESMKMEFS VETPVSGRVR QLFCKEGSHI
SAGQMLLIVQ EE