Gene Nmul_A0141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0141 
Symbol 
ID3784113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp147423 
End bp149114 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content59% 
IMG OID637810212 
Productcarbamoyltransferase 
Protein accessionYP_410842 
Protein GI82701276 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2192] Predicted carbamoyl transferase, NodU family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGATG GACCGGTAGT GCTTGGGGTT AACCGCACGC AGGATGCAAG CATCTGTCTG 
ATGCATGGAT CACAGCTCAT ATACGCAATA CAAAAGGAAC GGCTCACAAG GCGCAAGCAC
CACTGGGGCA GGCTCGACGA TTTCCGTAAT GCATACGTGC CGCATCTTCC CGATCTCGGC
AAGCCGATCG ATGTCCTTGT CGAATGCTAT TCATCCGACA GCGAGATAAA AAATTTTCCT
GCCTATGAGC GGGAGATGGC CGAGACGCTG AAGTTCGCCC CCGGCTGCCG GCGCAAGCGC
ATCTCGCACC ATCTCGCCCA TGTGTACAGC GTATTCCATC CGTCGCCATT TGAAGAGGCT
GCGGTGATGA TCATCGATGG CCAGGGCAGT CCGGTTTCCG AATTCACTGA GAAATGGAGC
GGGTCGGACA GTGTGCCTCA GGATTGGCGT GAAGTCTCCT CATTCTATCG GGCAGACAGA
GGGCGAATCG AATGCATCGG CAAGCAATTG TGGGACCGTA ACGAGCGATG CCTGGTTGGG
CTCGGAATGT TTTACTTCCT GCTGACACAG GCGGTTTTTC CAGGCGAGGG CAACGAAGGC
AAAGTGATGG GACTTGCCCC GCACGGCGAC CCGAATGCGC TCGGTCTGCC GCCCCTGCAG
GTGGAGGGTC CCCAGGTGAC GATTCCCGGT CGCTGGATGG AAATCCTGCG CGATCGCAGC
CGTTTTCGCT ATGCTGCCGA TGATACTTCC CGTTTTGCCG ATAGTGCCAA TCTGTCTGCT
GCCGGGCAGC GCGCATTCGA GGAGGCAGTG CTCGAAGTTG CCCGCTGGCT GCACGCGCAA
ACGGGCGCCG AGAACTTATG CTTTGCCGGA GGCACCGGTC TTAACTGCTC CACCAATGAT
CGCCTCTTGC GTGAAACCCC TTTCCAGCGC GTCTTCATTC CTCCGGCCCC GAGCGATGCA
GGCACTTCGC TTGGTTGCGC CGTGTACGGC CTTACCGAAG TGGGGGGAAT GCGTTGCGAC
TATAGATGGG AGAACGACTA TCTCGGGCCC GAGCCGCATC TGTCAGACAT CGAGTCCGTG
TTAAACGGGG CGGATGACCT TGTCGTGGAG CATATAGAAC AATCTGCCGA CTTGTGCGGA
CGTATAGCCG ACCTTCTGGC TGACAGCAAG GTGGTCGCGC TCTATCACGG GCGCAGCGAA
TTCGGCCCGC GCGCACTCGG CCACCGCAGT ATTCTTGGAG ATCCGCGCCA TGGCTATGTG
CGTGACTGGA TCAACGCCAG AGTCAAGGAG CGGGAATGGT TTCGTCCGCT GGCGCCGGTA
GTGCTGCTCG AGCAGGCGGA GGAATTCTTC GATATCCGCC GTCCATCCCC CTTCATGCAG
TTCGCCGCGC CGGTCTGGCC CAAGGCCGCC GACATCATCC CCGCTGTCAC GCATGTCGAC
TGCACGGCCC GGTTGCAAAC CGTTGGCGAA CAGGATGACC CCTTCCTGCG CACCCTGCTG
AAAGCATTTG AAGCCCGTAC CGGTGTTCCC GTAGTGCTCA ATACATCCTT CAACCGGAAG
GAAGAGCCCA TCGTTGAAAC GCCTGCCCAG GCGCTTGAGT CATTTCGCCG CACGCCCATG
CACGCACTCG CCATGCCGCC CTATCTCGTG CGCAAGCGTA TCGAACCCGA GGCGGTCACT
CCTGTCGGGT AG
 
Protein sequence
MGDGPVVLGV NRTQDASICL MHGSQLIYAI QKERLTRRKH HWGRLDDFRN AYVPHLPDLG 
KPIDVLVECY SSDSEIKNFP AYEREMAETL KFAPGCRRKR ISHHLAHVYS VFHPSPFEEA
AVMIIDGQGS PVSEFTEKWS GSDSVPQDWR EVSSFYRADR GRIECIGKQL WDRNERCLVG
LGMFYFLLTQ AVFPGEGNEG KVMGLAPHGD PNALGLPPLQ VEGPQVTIPG RWMEILRDRS
RFRYAADDTS RFADSANLSA AGQRAFEEAV LEVARWLHAQ TGAENLCFAG GTGLNCSTND
RLLRETPFQR VFIPPAPSDA GTSLGCAVYG LTEVGGMRCD YRWENDYLGP EPHLSDIESV
LNGADDLVVE HIEQSADLCG RIADLLADSK VVALYHGRSE FGPRALGHRS ILGDPRHGYV
RDWINARVKE REWFRPLAPV VLLEQAEEFF DIRRPSPFMQ FAAPVWPKAA DIIPAVTHVD
CTARLQTVGE QDDPFLRTLL KAFEARTGVP VVLNTSFNRK EEPIVETPAQ ALESFRRTPM
HALAMPPYLV RKRIEPEAVT PVG