Gene Nmul_A0510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0510 
Symbol 
ID3785639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp576365 
End bp577774 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content59% 
IMG OID637810592 
Productpeptidase S1C, Do 
Protein accessionYP_411210 
Protein GI82701644 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.727418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAC CGTCCGGATC AGTTTCGCAG CCAGCTCCAC TGTCAGTTTC ACCGCTGTCA 
GTTTCACTCT GGCTGGTTTT TATCGGCTTG CTGGTCATTG CGCTGCCGGG CTATCCGGCT
TTGCCTTTCA GTGACACCAA AAATGGTGTG CCGACCCTGG CGCCGGTGCT TCAGGACGTG
ACCCCGGCGG TCGTCAATAT TTCCGTAGTG ACGCGCTCGG CCATAGAAGA GAATCCCTTG
TTTCGGGACC CTTTCTTCCG CCGCTTTTTC AATCTTCCGG AGCAGGAGGC CCGGCCGGAG
CGCAGTGTGG GTTCCGGCGT GATCATGGAT GCGGCCAAGG GTTATGTGGT GACCAACTAC
CACGTGATCA AGGATGCGCA GCAGGTGGTT GTGACACTCA AAGACAGGCG CCAATTCCAG
GCGAAGCTGG TGGGGACGGA CCCGGGTACC GATATTGCCC TGCTGAAAAT CGATGCCAAG
AACCTTAAGG CGCTGCGCCT CGGTGATTCC GATCTGCTCA ACGTCGGGGA TTTTGTCATC
GCGATCGGGA ATCCGTTCGG TCTGGGGCAA ACGGTAACAT CCGGCATCGT CAGCGCATTG
GGAAGAAGCG GACTGGATAT CGAAGGCTAT GAAGACTTCA TCCAGACCGA CGCCTCCATC
AATCCGGGCA ACTCCGGCGG AGCCCTCATC AACCTCAAGG GGGAACTGAT CGGCATCAAC
ACCGCCATCA TTGGTCCCGC AGGCGGCAAC GTCGGCATTG GTTTTGCGGT TCCCAGTGTG
ATGGTGAAGG CAGTCCTCGA TCAAATCCTG CGCTTCGGCG AGGTACGCCG CGGCAGACTG
GGAGCGAGCA GCGAAGATAT TACGCACGAC CTGGCGGCCT CACTGGGGTT GCCTTCCACG
GAAGGCGCCA TCATCAGTGC GGTCGAGCCG GGATCGCCTG CGGAAAAAGC CGGGGTGAAA
CCGCGTGATG TGATAACGAC TGTCAACGGA AAACCTGTGC GGAACTCGAT TGATTTGCGC
AACAAGGTTG GACTGATGCC CATCGGGGAG ACGCTGGATC TCGGGCTGAT CAGGAATGGG
AAGCGGCTGA CTGCGAAAGT AAAGATAGCC AAGCTGGCGG AGGCTTCCAG TGCAGGTGCG
GAGGCCGTGC CGGAGGTCTC AGGCGCAACC GTGGCGGACT TGAAACCCGG CGCCCGCCGG
GGAATTGAGG GCGTGATGGT AACAGACGTG GAAGTCAATA GTCCCGCCTG GCTGCGGGGC
ATCCGGCCGG GGGACCTCAT CATTGGCGTC AATCGCCGCC AGGTCCGTTC GGTGCAGGAG
TTGCTTGCTG CTCTCAAGGC CAGCAGCCAG GGACGGCTCA CCCTAAGCCT GATACGGGGC
GACTTCAGGG TGACTATCAG GATCAGGTAA
 
Protein sequence
MRKPSGSVSQ PAPLSVSPLS VSLWLVFIGL LVIALPGYPA LPFSDTKNGV PTLAPVLQDV 
TPAVVNISVV TRSAIEENPL FRDPFFRRFF NLPEQEARPE RSVGSGVIMD AAKGYVVTNY
HVIKDAQQVV VTLKDRRQFQ AKLVGTDPGT DIALLKIDAK NLKALRLGDS DLLNVGDFVI
AIGNPFGLGQ TVTSGIVSAL GRSGLDIEGY EDFIQTDASI NPGNSGGALI NLKGELIGIN
TAIIGPAGGN VGIGFAVPSV MVKAVLDQIL RFGEVRRGRL GASSEDITHD LAASLGLPST
EGAIISAVEP GSPAEKAGVK PRDVITTVNG KPVRNSIDLR NKVGLMPIGE TLDLGLIRNG
KRLTAKVKIA KLAEASSAGA EAVPEVSGAT VADLKPGARR GIEGVMVTDV EVNSPAWLRG
IRPGDLIIGV NRRQVRSVQE LLAALKASSQ GRLTLSLIRG DFRVTIRIR