Gene Nmul_A0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0216 
Symbol 
ID3784593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp230469 
End bp231764 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content54% 
IMG OID637810288 
Productglutamate--cysteine ligase GshA 
Protein accessionYP_410916 
Protein GI82701350 
COG category 
COG ID 
TIGRFAM ID[TIGR02049] glutamate--cysteine ligase, T. ferrooxidans family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTACGC CCCACCTTGA TACTGCCTTG AGCGGCCCCA TTCTTGATCT TGAAAGATGC 
ATCCTCAATG CCATGCCTTC GATCGAGCAA TGGTTGCGCA ATGAATGGCA GACGCATGCT
GCGCCTTTTT ACTGCTCCGT GGATTTGCGC AATAGCGGGT TCAAGCTGGC GCCCGTCGAT
ACCAACCTGT TTCCAGGCGG CTTCAATAAT CTAAACCGCG AATTCATGCC CTTGTGCGTC
CAGGCGATGA TGGCGGCAAT CGATAAAATC TGTACCGGCG CGCGCAGCGT ACTGCTGATT
CCGGAAAGCC ACACGCGCAA TATTTTTTAT TTGCAAAATC TGGCGGCCTT GCACGCAATC
ATGCGGCATG CCGGCATTCA TGTCCGTATC GGAACGCTGC TCCCTGAGAT CACCGCACCC
ACTGCGCTCG ACCTGCCCGG TGGAAACAAG CTCACGCTCG AACCCATTCA GCGGAAGGGC
AATCGGGTCG TTCTGGAAGA TTTCGATCCT TGCGTCGTGC TTCTCAATAA CGACCTTTCC
ACCGGCACCC CGGCCGTCCT GCAGAATCTC GAGCAGACGG TCATTCCTCC CCTGCATGCG
GGATGGACCA GCCGGCGGAA ATCCCATCAT TTCACTGCCT ATGATAACGT TTCGCAGCAG
TTTGCGAGCC TGATCGGTAT TGACCCGTGG CTTATCAATC CTTATTTCGC TTCCTGCGGA
AAAATCAATT TTCGCGAGAA AAAGGGTGAA GACTGCGTGG CCAATACGGT GGATGAAATC
CTGCACCAGA TCCGGGAGAA ATATGCCGAG TATGGAGTCA GGAAAGATCC CTTCGTGATC
GTAAAGGCCG ACGCCGGTAC GTACGGAATG GGGATAATGA CTGTAAAAGA TGGCGCGCAA
GTACGTACAC TCAGCCGGAA ACAGCGCAAC AAGATGGCGG TCGTAAAAGA AGGATTGGAG
GTGACCGACA TCATGGTGCA GGAAGGGGTT TATACGTTTG AGAATGTCGA CGATGCGGTG
GCGGAGCCCG TCATCTATAT GATCGATCGC TATGTCGTCG GTGGTTTCTA CCGGGTGCAT
ACCGAGCGGG GCGTTGACGA GAATCTCAAC GCCCCCGGTA TGCATTTTGT ACCCCTGGCA
TTCGAAGATA CCTGTCTGCT ACCGGATCGG GAAGCGCAGC CGGGTTGCAG CGCCAACCGG
TTTTATGCTT ATGGCGTCAT AGCCAGGCTC GCTTTACTGG CGGCTGCACA AGAACTGGAA
AAAAGTGAGG CGAGAATCGA AGCGATCATG GCTTAG
 
Protein sequence
MPTPHLDTAL SGPILDLERC ILNAMPSIEQ WLRNEWQTHA APFYCSVDLR NSGFKLAPVD 
TNLFPGGFNN LNREFMPLCV QAMMAAIDKI CTGARSVLLI PESHTRNIFY LQNLAALHAI
MRHAGIHVRI GTLLPEITAP TALDLPGGNK LTLEPIQRKG NRVVLEDFDP CVVLLNNDLS
TGTPAVLQNL EQTVIPPLHA GWTSRRKSHH FTAYDNVSQQ FASLIGIDPW LINPYFASCG
KINFREKKGE DCVANTVDEI LHQIREKYAE YGVRKDPFVI VKADAGTYGM GIMTVKDGAQ
VRTLSRKQRN KMAVVKEGLE VTDIMVQEGV YTFENVDDAV AEPVIYMIDR YVVGGFYRVH
TERGVDENLN APGMHFVPLA FEDTCLLPDR EAQPGCSANR FYAYGVIARL ALLAAAQELE
KSEARIEAIM A