Gene Nmul_A1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1119 
Symbol 
ID3785699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1290142 
End bp1291587 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content54% 
IMG OID637811204 
Productcarboxyl-terminal protease 
Protein accessionYP_411814 
Protein GI82702248 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGACA AAATGCGGCA GTTCGGCCTC GTCATTTTCG GTGCAATCGC CGGCGTGATG 
CTGAGCCTCA ATTTTTCTGC TGTTGCCAAT AAGGAACCGC AGGGGGTCCT GCATCCCCTT
CCGGTGGAGG AGCTTCGGGC CTTCACAGAA GTGTTCGGCC GGATAAAAAA TGATTACGTC
GAACCGGTAG AAGACAAAAA GCTCATCACT GAAGCCATCA ATGGCATGCT GACAGGCCTC
GATCCCCATT CCGCCTATCT GGATGCAGAT GCTTTCAAGG AGCTGCAGAT TGGCACTCAG
GGTGAGTTCG GGGGCCTGGG CATCGAAGTG AGCATGGAAG ACGGATTTGT CAAGGTCATT
TCTCCTATCG AGGATACGCC CGCCTTCCGC GCCGGGATAA AACCCGGAGA TCTTATCATC
AAGCTGGACG ATACCGCGGT CAAGGGGCTC TCGCTTACGG AGGCCATCAA GCGCATGCGC
GGCAAGCCCG ATACGCCTAT TACCCTTACC GTCGTGCGCA AGGGCGAGGC CAAGCCGATT
GTGTTCCCAC TGGTTCGCGC CGTCATCAGG ATACAAAGCG TGAAGTCGAA AATGATCGAG
CCTGGCTACG GATATATTCG TATCACCCAG TTTCAGGAAC AGACGGGGGA AAACCTGGCG
AAGGCGATAG ATAAGCTTTT CAAGGAAAGC GGTGGCTCGA TGAAGGGACT GGTGCTGGAC
TTGCGTAATG ACCCGGGGGG CCTGTTGAAT GGAGCAGTGG CAGTATCTGC GGCCTTTTTA
CCGGAGGATT CTCTGGTTGT CTATACCGAT GGCCGTAGCG AAGATGCGAA GATGAAACTC
AAGGCCAGTC CCGAGTTCTA TCTGCGCGAT ACGAAGAACG ACTACGTCAA GCGGCTCCCA
GCGGGTATCA AGACCGTACC GATGGTGACA CTCGTAAATG GCGGCTCGGC TTCGGCTTCG
GAAATCGTTG CCGGCGCGCT GCAGGATCAT AAACGGTCGA TCGTGATGGG AACGCAAACC
TTCGGTAAAG GTTCGGTGCA AACCATTTTA CCGCTGGGAA ACAACACAGC TATCAAATTG
ACCACGGCAC GGTACTATAC GCCAAATGGA CAATCCATCC AGGCCAAGGG AATCACGCCT
GATGTCATGG ATGAATTGGC AAAGGATGAA GTCGAGCGCC TGCGTGAAGC TGATCTCGAT
CGTCATCTAT CCAATGGCAA GGCTGAGGAC CAGAAACGGG AGACGGAGGC GAAGGGAGTG
CCGGAAACAA AGGTCGCGCC AAAACCGGGA GTCAAGCCGG TCAAGTCCGA GAACGAGGAA
GACAAGAACA AGAAGCGTGA GTCACCTGCT GAGTTCGGTT CCAGCGATGA CCTGCTGCTG
GTTCAGGCTA TCAGCTATAT CAAGGAAAAC GCCAGCAAGC GCGCAACTGC AAAAGTAGAA
AACTGA
 
Protein sequence
MGDKMRQFGL VIFGAIAGVM LSLNFSAVAN KEPQGVLHPL PVEELRAFTE VFGRIKNDYV 
EPVEDKKLIT EAINGMLTGL DPHSAYLDAD AFKELQIGTQ GEFGGLGIEV SMEDGFVKVI
SPIEDTPAFR AGIKPGDLII KLDDTAVKGL SLTEAIKRMR GKPDTPITLT VVRKGEAKPI
VFPLVRAVIR IQSVKSKMIE PGYGYIRITQ FQEQTGENLA KAIDKLFKES GGSMKGLVLD
LRNDPGGLLN GAVAVSAAFL PEDSLVVYTD GRSEDAKMKL KASPEFYLRD TKNDYVKRLP
AGIKTVPMVT LVNGGSASAS EIVAGALQDH KRSIVMGTQT FGKGSVQTIL PLGNNTAIKL
TTARYYTPNG QSIQAKGITP DVMDELAKDE VERLREADLD RHLSNGKAED QKRETEAKGV
PETKVAPKPG VKPVKSENEE DKNKKRESPA EFGSSDDLLL VQAISYIKEN ASKRATAKVE
N