Gene Nmul_A1723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1723 
Symbol 
ID3786200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1967093 
End bp1968073 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content55% 
IMG OID637811810 
Productputative glutathione S-transferase 
Protein accessionYP_412413 
Protein GI82702847 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0435] Predicted glutathione S-transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.959435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTTTAC TCGTTAAAGG CAAGTGGGTG GACGAGTGGT ACGATACCAG ATCGACCGGC 
GGCCGGTTCA TCCGTACCAA TGCGCAGTTC CGCAACTGGA TAACGGCCGA TGGCAGTCCG
GGGCCCACCG GCGAAGGCGG GTTTCCCGCC GAGGCGGGGC GTTATCATCT ATATGTCTCG
CTTGCTTGCC CCTGGGCTTC CCGCACGCTG ATTTTTCGGA TGCTCAAAGG GCTTGAGAGC
ATGATCAGTG TTTCGGTGGT GCATCCCTAC ATGGGCGAGC ATGGCTGGAC TTTTGATGAG
GCGCCGGGAG TAATACCTGA TCCCGTGGGT GGCGCATCCT ATCTTTATGA AGTCTACCTC
CGGTCGGTGC CTGACTATAG TGGACGCGTG ACAGTACCCG TGCTCTGGGA TTTGCAGCGG
AATACCATTG TCAGCAATGA ATCGGCCGAT ATCATCCGCA TGATGAACTC GGCTTTCGAT
GGAATAGGCG CTTTGCCCGG GAATTATGCG CCTGAGGTAT TGCTTCCACA GATCGCCGAG
ATCAATGCGC GCATTTACGC TGACGTCAAT AATGGCGTTT ACAAGGCAGG TTTTGCCACT
AGGCAATCGG TATATGAGAA GGCGGTGATG GTGCTGTTCA GATGCATGGA CGAGCTGGAA
CAACTGCTTT CACGTCAGCG TTATCTCATC GGCAACTGTA TCACTGAAGC CGATTGGCGG
ATATTCACCA CGCTGATCCG CTTTGATCCG GTCTATCACG GCCATTTCAA GTGCAACCTC
AGGCGTCTCG TGGATTATCC CAATCTCTGG GCCTACACAC GGGAGTTGTA TCAATGGCCG
GGTGTGGCAG AGACTGTGAA CATGCAGCAC ATCAAGGAGC ACTATTACCG CAGTCATCCC
ACCATCAATC CGAATCGCAT TGTGCCGGTG GGCCCGATCC TGAATCTCGA TGAGCCCCAT
GATCGCACGA AGCTGGCATA G
 
Protein sequence
MGLLVKGKWV DEWYDTRSTG GRFIRTNAQF RNWITADGSP GPTGEGGFPA EAGRYHLYVS 
LACPWASRTL IFRMLKGLES MISVSVVHPY MGEHGWTFDE APGVIPDPVG GASYLYEVYL
RSVPDYSGRV TVPVLWDLQR NTIVSNESAD IIRMMNSAFD GIGALPGNYA PEVLLPQIAE
INARIYADVN NGVYKAGFAT RQSVYEKAVM VLFRCMDELE QLLSRQRYLI GNCITEADWR
IFTTLIRFDP VYHGHFKCNL RRLVDYPNLW AYTRELYQWP GVAETVNMQH IKEHYYRSHP
TINPNRIVPV GPILNLDEPH DRTKLA