Gene Nmul_A1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1066 
Symbol 
ID3784886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1233764 
End bp1234714 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content53% 
IMG OID637811150 
Productpeptidase S49 
Protein accessionYP_411761 
Protein GI82702195 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.609608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGATT CACAGAATCG AAACGAAGGT TGGGACAGGC AGGTGCTGGA AAAGCTGGTG 
TTCTCCACCT TGCAGGAACA GCGCCGCACG CGGCAGTGGG GGGTGTTTTT CAAGTCCCTT
ACTTTTATCT GGCTGTTTAT CCTCCTGTTT TTCGGTCTTG GCTGGTTTGG AGACAGCAGT
ATGTCCATTT CCGGCAAGCA TACCGCTCTC GTCGATTTGC GCGGTGTAAT CTCTCCCGAT
AGCATCAGCA GCGCGGAAAA CATCACTGCC GGCCTGCAGC AGGCATTCAA GGACGCAAAA
ACGCAGGGGG TGATCCTGCG CATCAACAGT CCCGGGGGGA GTCCCGTTCA GGCAGGATAC
ATCAACGATG AGATACGCCG CCTGCGTGCA GAATATCCTG AAATACCCCT CTACGCTGTC
GTGGAAGATA TCTGCGCTTC GGGCGGCTAT TATGTAGCGG TTGCCGCCGA CAAGATATAT
GTGGACAAGG CAAGTATCAT TGGTTCCATC GGCGTCCTGA TAAACGGGTT CGGTTTTACA
AAAGCAATGG AAAAACTTGG CATCGAAAGG CGCTTGATCA CGGCAGGAGA AAACAAGGCT
TTTCTCGATC CATTTTCTCC CAACAATCGC GAGCAGGAGG AATATGCCAA GAAAATGCTG
GGTGATATCC ATGAGCAATT CATTCAGGTA GTTCAGCAAG GCCGGGGCGA ACGCCTGAAG
GAAAAGCCGG AAATATTCAG CGGCAAGGTG TGGACAGGTC AAAAAAGTGT CGAACTGGGA
CTTGCTGACG GAATGGGCAG CGCGGAATAC GTGGCGCGGG AAATTATCAA GGCGGAACAC
ATCGTCGACT ATACGACCCG GGAGGGAGTT GCCGAGCGCC TCGCCAAACG CTTTGGAGGA
GTCCTGGCGG AAACGCTGAG TGGTTTGGGA ATGAGTGCGG AACTCCACTA A
 
Protein sequence
MSDSQNRNEG WDRQVLEKLV FSTLQEQRRT RQWGVFFKSL TFIWLFILLF FGLGWFGDSS 
MSISGKHTAL VDLRGVISPD SISSAENITA GLQQAFKDAK TQGVILRINS PGGSPVQAGY
INDEIRRLRA EYPEIPLYAV VEDICASGGY YVAVAADKIY VDKASIIGSI GVLINGFGFT
KAMEKLGIER RLITAGENKA FLDPFSPNNR EQEEYAKKML GDIHEQFIQV VQQGRGERLK
EKPEIFSGKV WTGQKSVELG LADGMGSAEY VAREIIKAEH IVDYTTREGV AERLAKRFGG
VLAETLSGLG MSAELH