Gene Nmul_A2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2097 
Symbol 
ID3784668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2388014 
End bp2390386 
Gene Length2373 bp 
Protein Length790 aa 
Translation table11 
GC content58% 
IMG OID637812185 
ProductATP-dependent protease La 
Protein accessionYP_412782 
Protein GI82703216 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGAGC TTATCGTAGC CAAGCCGGAA TTACCTCAGG ACGTGATTGC CCTCATACCC 
ATGCGCAATA TAGTGCTATT CCCGCATGTT CTGACTGCAA TCACCGTGGG TCGTGCCAAA
TCCATTGCTG CGCTGGAACA TGCGCTCGAT CCTAAAAGGC CCCTAGGCAT CATTCTGCAG
AAAGATCCTG CTGTGGATGA GCCGGGGCAG GATGCGCTGT TTAATGTAGG TACCGTGGTG
AACGTTGTGC GGCATCTTGC CTCATCCGAC GGGCTGCGGC ATGCCGTGTG TCAGGGGTTG
GGGCGCTTTT CCATCGAGGA AATGATCGAG GATCGTCCTT TTCTCGCTGC CCGTGTGCGA
CTTATTGCGG AGCCTGATGA GGTATCGACG GAAGCTGAAG CCCTGGCGAT GCAATTGCGG
GAGCGCACGG TGGAGATTCT TTCCCTGCTG CCCGGCGTAC CGGCAGAACT GGCGCATGCG
CTACAAGCGA CCCGCGCCCC TTCGCACCTG GCTGATATCG CGGCAAGCCT GCTCGATACC
GAAGTTGCGG AGAAGCAGAT GCTGCTTGAA ACGGTCAGTA CCGAGGAGCG GCTGCGAAAG
GTGCTGCAGA TTTTGTCGCG TCGTATCGAG GTACTGAGAT TATCCCAGGA GATTGGTGAG
CGTACGAAGG AGCATCTGGA AGACCGTGAA CGCAAGTTTC TGTTGCGGGA GCAATTGAAG
ACGATCCAGA AAGAACTCGG CGAGACGGAG GGGGACGACC AGGAAATAGA GAAGCTGGAT
GAAGCGGTGG CCAAGGCTGG CATGCCGGAA GAGATCGAGG CGCAGGCCAG AAAGGAATTG
CAGCGCTTAA AGCGCATGCC TCCGGCTTCA AGCGAGTATT CGATGCTGCA TACCTACCTC
GAATGGATGA CAGAGTTGCC CTGGAAGCTC CCGGAAGATG CACCTATTGA CCTCGATGCC
GCACGCCGAA TTCTCGAACA CGATCATTTC GGGCTGGAGC GCGTCAAACA GCGCATTATT
GAATTCCTGG CCGTTCAGAA GTTGAAGCCC CAAGGGCGTG CTCCTATCCT GTGTTTTGTC
GGACCACCGG GGGTAGGCAA GACTTCGCTG GGCCAGAGCA TAGCACGGGC ATTGCAGCGG
CCCTTCGTGC GCGTATCATT GGGTGGCGTG CATGACGAGG CGGAAATGCG AGGCCATCGT
CGCACCTACA TTGGCGCCAT GCCGGGTAAC ATTGTGCAGA GCCTGCGCAA GGCTGGCGCT
CGAAACTGCG TGATGATGCT CGATGAGGTG GATAAGATGT CAGCCAGCCT CCACGGCGAC
CCATCGGCAG CTCTGCTAGA GGTGCTGGAT CCCGAGCAGA ATTCTACGTT CCGGGACAAT
TACCTGGGGG TGCCCTTCGA TTTGAGCCGT GTTGTTTTCA TCGCCACCGC GAATGTTATT
GACAACGTGC CGCCTCCAGT GCGCGACCGG ATGGAGATTA TTGATCTCCC GGGCTACACC
CGGGAGGAAA AGCTTCAGAT CGCCCAGCGT TACCTCGTGG GGCGTCAACG CGAGGTAAAC
GGCCTCAGCG AGGATCAGTG CGAAATATCG GTGGAAGCGC TCGATGGCAT CATCGCCAAT
TACACCCGTG AGGCGGGAGT GCGGCAACTC GAACGGGAGA TCGGGCGCGT CATGCGGCAT
GCAGCCATGC GTGTTGCGTC GGATGCAGAG GCTAAGGTAC GCGTGGATGC CGCAGATCTG
GATACTATCC TCGGCCCTGC CAAATTCGAG CACGAGACCG GGCTGCTCAC CAGTTTGCCA
GGTGTTGCAA CGGGACTTGC CTGGACACCC GTAGGCGGTG ACATTCTTTT TATCGAGGCA
ACACGGGTGA GCGGACGTGG GCAGCTTATC CTAACCGGGC AGCTTGGCGG CGTGATGAAG
GAAAGTGCGC AGGCGGCGCT TACGTTGTTG AAAGGCCGGG CAGACAGTCT TCACATCCCC
GCATCCGTAT TTGAAGGCAT CGACGTGCAC GTGCATGTAC CAGCAGGGGC TATTCCTAAA
GATGGTCCCA GTGCAGGGGT GGCGATGTTC ATAGCGCTTT CTTCGCTCTT TACCAACCGC
CCGGTACACC GGGATGTGGC GATGACGGGA GAGATCAGCC TGCGAGGGAT GGTGCTGCCG
GTGGGAGGTA TCAAGGAGAA GGTGCTTGCA GCCCAGCGGG CGGGCTTACG CGTGGTGCTG
CTGCCCGCGC GCAACGAGAA AGACCTGCGC GAGGTGCCGG AAACAACGCG TTCCACCTTG
GAGTTCGTTT TTCTGGAAAC AGTGGACGAT GCGATTCAGG CATCGCTGGG TCAGCGGGCG
CGACGTCAGG AGTCGGAGTT CAAGCTTGTC TGA
 
Protein sequence
MGELIVAKPE LPQDVIALIP MRNIVLFPHV LTAITVGRAK SIAALEHALD PKRPLGIILQ 
KDPAVDEPGQ DALFNVGTVV NVVRHLASSD GLRHAVCQGL GRFSIEEMIE DRPFLAARVR
LIAEPDEVST EAEALAMQLR ERTVEILSLL PGVPAELAHA LQATRAPSHL ADIAASLLDT
EVAEKQMLLE TVSTEERLRK VLQILSRRIE VLRLSQEIGE RTKEHLEDRE RKFLLREQLK
TIQKELGETE GDDQEIEKLD EAVAKAGMPE EIEAQARKEL QRLKRMPPAS SEYSMLHTYL
EWMTELPWKL PEDAPIDLDA ARRILEHDHF GLERVKQRII EFLAVQKLKP QGRAPILCFV
GPPGVGKTSL GQSIARALQR PFVRVSLGGV HDEAEMRGHR RTYIGAMPGN IVQSLRKAGA
RNCVMMLDEV DKMSASLHGD PSAALLEVLD PEQNSTFRDN YLGVPFDLSR VVFIATANVI
DNVPPPVRDR MEIIDLPGYT REEKLQIAQR YLVGRQREVN GLSEDQCEIS VEALDGIIAN
YTREAGVRQL EREIGRVMRH AAMRVASDAE AKVRVDAADL DTILGPAKFE HETGLLTSLP
GVATGLAWTP VGGDILFIEA TRVSGRGQLI LTGQLGGVMK ESAQAALTLL KGRADSLHIP
ASVFEGIDVH VHVPAGAIPK DGPSAGVAMF IALSSLFTNR PVHRDVAMTG EISLRGMVLP
VGGIKEKVLA AQRAGLRVVL LPARNEKDLR EVPETTRSTL EFVFLETVDD AIQASLGQRA
RRQESEFKLV