Gene Nmul_A2338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2338 
Symbol 
ID3784741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2663193 
End bp2665604 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content56% 
IMG OID637812428 
ProductATP-dependent protease La 
Protein accessionYP_413021 
Protein GI82703455 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La
[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0304727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCTC CCATGAACGA TCAAAATCAG ATTTCATTAC CTTTGTTGCC ATTGCGCGAT 
GTGGTGGTTT TTCCGCACAT GGTGATTCCC CTGTTTGTCG GCCGGCCCAA GTCCATCAAG
GCACTGGAAA TAGCGATGGA GTCCGGCAAG AGCATTTTGC TTGTGGCGCA GAAATTTGCT
GCCAAGGACG AGCCGGCTCC CGAAGATTTG TACGGGGTAT GCAGCGTTGC AAACCTGCTG
CAAATGCTCA AGCTGCCAGA CGGTACCGTG AAAGTATTGG TGGAAGGCGG TCGCCGCGCT
CGCATCGTGA AAGTGGTTGA CGACGGTACG TACTTCGCCG GCGACGCAGC ATTGCTGCCG
CCCGATGCCG TGGACAACCA TGAGGTGGAG GCGATGCGCC GCGCCATGCT GGCCCAGTTC
GACCAGTACG TGAAGCTGAA CAAGAAGATT CCCCCGGAGA TCCTGACTTC GCTCAGCGGC
ATAGACGAAG CAGGTCGTCT GGCTGATACC ATCGCCGCAC ACCTGCCCTT GAAGCTTGAG
CAGAAGCAGG AAGTGCTGGA AATTTTCGAC GTGCCGAAGC GCCTGGAACA CCTGCTTGGT
CTGCTGGAAA CCGAGCTTGA CATACTTCAG GTGGAAAAGC GTATTCGTGG TCGTGTAAAG
CGGCAGATGG AAAAAAGCCA GCGGGATTAC TATCTCAATG AGCAGGTGAA GGCCATCCAG
AAGGAATTGG GTGAAGGCGA GGAAGGCGCT GATCTGGAAG AGATGGACAA GAAGATCAAA
AACGCCCAGA TGTCCAAGGA GGCGCGTGCC AAGGCGGAAT CGGAGTTGAA AAAACTGCGG
TTGATGTCGC CCATGTCGGC AGAAGCCACT GTCGTGCGTA ATTATATTGA TGCGCTGGTG
GCCCTGCCAT GGAAGAAGAA AAGCAAGATA AGCAAAGACC TCAGCGTTGC GGAAGCCGTG
CTCGAGCAGG ATCACTACGG CCTGGAAAAA GTCAAGGAAC GGATTGTCGA GTATCTGGCG
GTACAGCAGC GCGTGGATAA GTTGAAAGCG CCCATCCTCT GCCTGGTGGG GCCGCCCGGG
GTAGGGAAGA CTTCATTGGG GCAATCGATC GCCCGGGCTA CCAACCGGAA GTTCGTCCGC
ATGTCGCTCG GCGGCGTCAG GGATGAAGCA GAGGTACGTG GTCATCGCCG CACCTATATC
GGTTCCATGC CTGGCAAGAT TCTGCAGAAC ATGGCGAAAG TGGGTGTGAA GAACCCCTTG
TTCCTGCTGG ACGAGGTGGA CAAGATGGGC ATGGATTTTC GGGGTGATCC GTCCTCTGCG
CTCCTGGAAG TGCTGGATCC CGAGCAGAAT AATTCGTTCG TCGATCACTA TGTCGAGGTT
GAGTACGATC TGTCGGACGT CATGTTCGTC GCGACTGCGA ATACGTTGAA CATCCCTGCG
CCGCTGCTGG ATCGCATGGA AGTGATCCGG CTGTCCGGCT ATACCGAGGA TGAAAAACTC
AACATCGCGA CACGCTATCT GTTGCCGAAA CAGATGAAAA ATCACGGTTT GAAGGAAAAT
GAACTGACCG TCTCGGAGTC AGCCCTGCGG GATATCACGC GCTATTACAC CCGTGAAGCG
GGCGTGCGGG CGATGGAGCG GGAAATTTCC AAAATATGCC GTAAAGTGGT CAAGGCGTTG
CTGCTGAAAG GCGGGCAGAA ACGGATTACC GTCACCGGGA GGAACCTGGA CAAATATCTC
GGTGTAAGGC GCTATACCTA CGGTGTCGCG GAGGAAAAGA ACCAGATCGG CCAAGTGACG
GGCCTCGCCT GGACCGAGGT CGGGGGGGAA TTGCTGACGA TCGAGGCCGT CGTATTGCCG
GGTAAGGGTA AATCCATCAC GACCGGCAAA CTGGGCGAGG TCATGCAGGA ATCTGTCCAG
GCTGCCCTGT CGGTGGTTCG CAGCCGCTCA AGGGCTCTGG GTATCGCGGA TGATTTTTAC
CAGAAGAACG ACATCCATAT CCATCTGCCG GAGGGTGCGA CCCCGAAAGA CGGTCCCAGT
GCCGGTATAG GTATCTGCGT GGCGATGGTG TCGGCGTTGA CCAACATTCC GGCCCGCGCA
ACTGTCGCGA TGACCGGTGA GATCACACTT CGCGGTGAGG TGCTGGCGAT TGGCGGACTC
AAGGAAAAAC TGCTCGCCGC GCATCGCGGC GGCATAAAGA CTGTGCTGAT TCCCGAGGAT
AACGTTAAGG ATCTGAACGA AATCCCGGAG AATATCAAAA ACAAGCTGGA TATTCATCCG
GTCAAATGGA TAGATCAGGT GCTGGATCTG GCCCTGGAAT CCAAACCGGA ACCGCTTCCG
GCCGCTCCTT CATCCGTTCC CTCTCCTGTT GCGGTGGAAG GCGATGTGAC GCCGGCGGTC
ATCAAGCACT AG
 
Protein sequence
MSSPMNDQNQ ISLPLLPLRD VVVFPHMVIP LFVGRPKSIK ALEIAMESGK SILLVAQKFA 
AKDEPAPEDL YGVCSVANLL QMLKLPDGTV KVLVEGGRRA RIVKVVDDGT YFAGDAALLP
PDAVDNHEVE AMRRAMLAQF DQYVKLNKKI PPEILTSLSG IDEAGRLADT IAAHLPLKLE
QKQEVLEIFD VPKRLEHLLG LLETELDILQ VEKRIRGRVK RQMEKSQRDY YLNEQVKAIQ
KELGEGEEGA DLEEMDKKIK NAQMSKEARA KAESELKKLR LMSPMSAEAT VVRNYIDALV
ALPWKKKSKI SKDLSVAEAV LEQDHYGLEK VKERIVEYLA VQQRVDKLKA PILCLVGPPG
VGKTSLGQSI ARATNRKFVR MSLGGVRDEA EVRGHRRTYI GSMPGKILQN MAKVGVKNPL
FLLDEVDKMG MDFRGDPSSA LLEVLDPEQN NSFVDHYVEV EYDLSDVMFV ATANTLNIPA
PLLDRMEVIR LSGYTEDEKL NIATRYLLPK QMKNHGLKEN ELTVSESALR DITRYYTREA
GVRAMEREIS KICRKVVKAL LLKGGQKRIT VTGRNLDKYL GVRRYTYGVA EEKNQIGQVT
GLAWTEVGGE LLTIEAVVLP GKGKSITTGK LGEVMQESVQ AALSVVRSRS RALGIADDFY
QKNDIHIHLP EGATPKDGPS AGIGICVAMV SALTNIPARA TVAMTGEITL RGEVLAIGGL
KEKLLAAHRG GIKTVLIPED NVKDLNEIPE NIKNKLDIHP VKWIDQVLDL ALESKPEPLP
AAPSSVPSPV AVEGDVTPAV IKH