Gene Nmul_A1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1043 
Symbol 
ID3785170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1206513 
End bp1207736 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content55% 
IMG OID637811127 
Productargininosuccinate synthase 
Protein accessionYP_411738 
Protein GI82702172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGTCG CTCAGATTAA AAAAGTAGTC CTGGCATTTT CCGGCGGTCT GGATACGTCG 
GTAATCCTGA AGTGGCTGCA GGATACCTAC CGGTGCGAAG TTGTCACCTT CACCGCTGAC
ATTGGCCAGG GGGAGGAGGT GGAACCTGCC CGTGCAAAGG CGAAGCAGCT TGGCGTGAGG
GAAATCTTCA TCGACGACTT GCGCGAAGAA TTCGTACGCG ATTTCGTCTT TCCCATGTTT
CGGGCCAATA CCCTTTATGA AGGGGAGTAC CTGCTCGGCA CCAGCATCGC ACGCCCGCTG
ATTGCCAAGC GCCAGATCGA AATCGCGCGA GAAACCGGTG CGGACGCCGT GTCGCACGGC
GCGACCGGGA AAGGCAATGA TCAGGTGCGC TTTGAACTCG GTTACTATGC GCTTCAACCG
GATATCCGGG TGATTGCGCC GTGGCGCGAA TGGGACCTTA CATCCCGGGA AAAGCTTCTG
AAATACGCGG AGCAGCACGG CATCCCGGTG GAAATGAAGA AAAAGGAAGG CTCTCCCTAC
AGTATGGATG CCAACCTTCT GCATATTTCG TATGAGGGAC GTATTCTCGA AGACCCTGCG
CAGGAGCCGG AAGAGTCGAT GTGGCGGTGG AGTGTTTCGC CGGAAAAAGC TCCGGATAGT
CCCGAATATC TCGACCTCGA ATTCAGGCAG GGCGATATCG TTGCATTGGA TGGGGAAGAG
CTTTCGCCGG CGCGCCTGCT CGCCAGACTC AATGAACTGG GGGGCAAGCA CGGGATAGGG
CGCCTGGATC TGGTCGAAAA CCGGTATGTC GGTATGAAGT CCCGCGGGTG CTACGAGACT
CCCGGCGGCA CGATCATGCT GCGTGCCCAC CGGGCCATGG AGTCCATTAC ACTGGACCGT
GAAGTCGCAC ATTTGAAAGA CGAGCTTATG CCACGCTATG CCGAACTGAT TTACAACGGC
TATTGGTGGA GTCCCGAACG CAGAATGATG CAGACGATGA TCGATGCGTC ACAGGCGCAT
GTGAACGGCT GGGTGCGGGT GAAGCTCTAC AAGGGTAATG TGATTGTCGT GGGGAGAGAC
TCGAAAACAG ATTCGCTGTT TGATCCCCAT ATTGCAACCT TCGAGGATGA CCAGGGTGCG
TACAATCAGA TGGATGCTGC CGGTTTCATC AAGTTGAATG CACTCAGGAT GCGGATTGCC
GCCAATTTAA GAAATCGCAA ATAA
 
Protein sequence
MNVAQIKKVV LAFSGGLDTS VILKWLQDTY RCEVVTFTAD IGQGEEVEPA RAKAKQLGVR 
EIFIDDLREE FVRDFVFPMF RANTLYEGEY LLGTSIARPL IAKRQIEIAR ETGADAVSHG
ATGKGNDQVR FELGYYALQP DIRVIAPWRE WDLTSREKLL KYAEQHGIPV EMKKKEGSPY
SMDANLLHIS YEGRILEDPA QEPEESMWRW SVSPEKAPDS PEYLDLEFRQ GDIVALDGEE
LSPARLLARL NELGGKHGIG RLDLVENRYV GMKSRGCYET PGGTIMLRAH RAMESITLDR
EVAHLKDELM PRYAELIYNG YWWSPERRMM QTMIDASQAH VNGWVRVKLY KGNVIVVGRD
SKTDSLFDPH IATFEDDQGA YNQMDAAGFI KLNALRMRIA ANLRNRK