Gene Nmul_A1888 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1888 
Symbol 
ID3784260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2173441 
End bp2175321 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content59% 
IMG OID637811974 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_412575 
Protein GI82703009 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.67203 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATATCA ATCATAAAGC GCCTGCAACT GGCTCAAGAG AACCCGCTTC AGGCATTATT 
CAATCAAGCA CTCGCAACTC GCAGCACATC TCCAAACCTT GGTATCCCCT TTGGAGCGGC
ATACTGGCAG TTTTGCTGGG ATTGGGTCCG GTCATTTCCA GTGCAGAGCC TCCGGAGGGC
AGGGGGCCTG GAATGAACAA GCCGCTTGGC TGGGCGAAGG GTCGCATCCT GGTCATGCCG
CGTGCCGGCT TGCCGGAAAA GGAACTGGCC AAGGTTCTGG GCGAACATGG CGGCAAGGGC
AGGAAGATCG GGCAGAGCGA TCTGTACATC GTCGACTTGC CGGGTAATGC CTCTGAGAAG
GCGGTGGCGG CAAGGCTTGC GCATCATCCG GCGCTCAAGT TTGCCGAGAT CGACCAGGAA
GTCGAACCCG CGCTCATACC GAACGATCCC TACTATGGCA GCGCCTGGCA TTTGCCCAAG
ATTGGCGCTC CGTCCGCCTG GGACAGTTCC CTGGGCAGCG GCGTGACCAT CGCCATCCTG
GATTCAGGAG TTGACCCGAA TCACCCTGAT CTCTCACCCC AGCTTGTTCC AGGCTGGAAT
TTCTACGAAA ACAATTCCAA CACGTCGGAT GTTTACGGGC ATGGAACCAA GGTTGCAGGG
TCAGCGGCAG CCATGGCAAA CAATGCCCAA GGCGTTGCCG GCGTGGCGGC GTTGGCTAAA
ATAATGCCTG TCCGCGTGAG CGGGAGTACT GGATATGCCA CCTGGAGTGC GATTTCCCAA
GGGCTTATCT ATGCGGCAGA CCGCGGCGTT CGCGTCGCCA ATGCCAGCTT CCTCGGACTG
ACCGACAGTG CGAGCACCCG GAGCGCGGCG CAATACATGA AGGATAAAGG TGGCCTGGTG
ATCGTGAGTG GCGGAAATAC GGGAGTGCTC CAGAATTATG CAGTAACGAC ATCGATGGTA
GCTGTAGCCG CTACAGATGC GAACGACTTC AGAGCCAGTT GGTCGAGCTA CGGAAACTAC
ATCTCCCTTG CCGCGCCTGG AGCCGGAATT TGGACGACGA CTATGGGGGG CGGCTATGGC
GCGGCTTCCG GCACATCATT CTCAAGTCCG GTAACGGCCG GCGTGGTGGC GCTCATGATG
GCGGCAAAGC CGACATTATC CAATACCCAG ATCGAGAGTC TGCTGTTTTC AACTGCCGTG
GATCTGGGCA CAGCGGGGCG CGATCCCTAT TATGGCTATG GACGGGTTGA TGCTGCACGC
GCGGTGCAGG CGGCGGCCTC GGCAGTGGTC GCCGGAGACA CGCAGGCCCC GGCGGTTTCC
ATCACTTCCC CTGCTGGCGG ATCAAGCGTA ACAGGTCTGG TCGGCGTAAA CGTGGCTGCG
AGCGATAACG TCGGCGTGAC CCGTGTCGAG TTGCGGGTCA ACAATACCAC AGTCGCAGTC
GACACCACGG CGCCGTTCGC CTTCACCTGG GATTCGGCAG GCGTAGCCAA CGGTATGGCG
AACCTGACCG CTTATGCATT CGATGCGGCC GGCAACTCCA AGGCTTCGAC CACCGTCTCG
GTGAATGTGG CAAACGGAAC GACAACGGTG GCCAGGGATA CTACCGCGCC ACAGGTGAAG
ATCGTAAATC CGGTCACAGG CAATGTTTCG GGAAGCAACG TGCCCATCAG CGTAAATGCA
AGCGATGATA GCGGCGCTTC CGGAATTACC TATGCGCTCT ACATCGACGG CGTGCTCAAG
GCTACCGGAA AGGGAAGTAC GCTGGGTTAC AACTGGAATA TACGTAATGT TGCCGCGGGT
GCACATACTG TTCAGGTAGT CGCCAAGGAT GCGGCGGGAA ATAGCGCATC CTCATCCGTC
AGCGTGACGG TAGTCAAGTA G
 
Protein sequence
MNINHKAPAT GSREPASGII QSSTRNSQHI SKPWYPLWSG ILAVLLGLGP VISSAEPPEG 
RGPGMNKPLG WAKGRILVMP RAGLPEKELA KVLGEHGGKG RKIGQSDLYI VDLPGNASEK
AVAARLAHHP ALKFAEIDQE VEPALIPNDP YYGSAWHLPK IGAPSAWDSS LGSGVTIAIL
DSGVDPNHPD LSPQLVPGWN FYENNSNTSD VYGHGTKVAG SAAAMANNAQ GVAGVAALAK
IMPVRVSGST GYATWSAISQ GLIYAADRGV RVANASFLGL TDSASTRSAA QYMKDKGGLV
IVSGGNTGVL QNYAVTTSMV AVAATDANDF RASWSSYGNY ISLAAPGAGI WTTTMGGGYG
AASGTSFSSP VTAGVVALMM AAKPTLSNTQ IESLLFSTAV DLGTAGRDPY YGYGRVDAAR
AVQAAASAVV AGDTQAPAVS ITSPAGGSSV TGLVGVNVAA SDNVGVTRVE LRVNNTTVAV
DTTAPFAFTW DSAGVANGMA NLTAYAFDAA GNSKASTTVS VNVANGTTTV ARDTTAPQVK
IVNPVTGNVS GSNVPISVNA SDDSGASGIT YALYIDGVLK ATGKGSTLGY NWNIRNVAAG
AHTVQVVAKD AAGNSASSSV SVTVVK