Gene Nmul_A2336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2336 
Symbol 
ID3785326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2660638 
End bp2662518 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content54% 
IMG OID637812424 
ProductPpiC-type peptidyl-prolyl cis-trans isomerase 
Protein accessionYP_413019 
Protein GI82703453 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0760] Parvulin-like peptidyl-prolyl isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00116871 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATT TCGTCAACAA GAAAAGACGC ATCGTTCAGA TCATCCTGGG ACTTGCCACG 
TTACCGTTCG TTTTCTGGGG TGTGGAATCC TATCGCAATG CGGATGCAGG TGACCATGTT
GCGCTTGCCG CAGGAGAAAA AATTTCCCGC CAGGAATTTG AGCAGGCGCT GCGCAACCAG
CAGGAGAATA TGCGCGCCAC CCTGGGAGAG AATTTCAGCC CGGCGCTGCT GGAAAAGCCC
GAAGTGAGAG CGGCCATCCT GGAAGGACTG ATCCAGCAGC GCTTGCTGCG GCAGGAGGCA
TCTCGCGTTG GACTAGGGGT AGCCGACGCC CAGCTGATCG AGATGATCCA GAACATCGAT
TTATTTCAGG AAGACGGAAA TTTTTCGAAG CAGCGCTATG AAGAATGGCT GCGGAACCAG
GGAATGAACG CGCGTGCTTT TGAAGCACGC CTGCGTCAGG ATTTAATGCG ACAGCAACTG
GTTGATCCTT TCTCCAAAAA TGCGTTTATT TCCCATTCAG TGGCGGAAAG AATCCTGCGC
CTGGGTGACG AAAAGCGTGA AGTGAGGGTT GTTCAAATCC AACCGGAGCA GTTTCTGGCG
AATGTCAAAC CGGGTAATGA CGCCATCAGA GCCTACTATG AGGCGCATAA GGCTGAATTC
CAGGTGCCGG AGCAGGCGCG GGTCGAGTAT GTAGTGTTAT CGATGGATGC GCTGGCCGAG
AAGGCGCAGG TAACTGCGGA TGAAGCGCTG GCCTATTACG AGGAGCACAA GCACGAATTC
GGACAACCCG AAGAACGCCG CGCGAGTCAT ATCCTCATAA GCGCCCCTGC CTCCGCTTCC
GATCGCGCGA CAGCGCGCGC CAAAGCTGAG GAACTGCTGG CCGAAGTCAG AAAATCCCCG
CAGCGTTTTA CGGAATTGGC GAAACAGCAT TCCCAGGATC CGGGCTCGGC GCCGACCGGG
GGAGATCTCG GTTTCTTCGC GCGCAACATG ATGACGAAGT CATTTGAAGA CGCGGTTTTC
CGGATGAAGC CGGGTGAAAT CAGCGATATC GTGGAGACTG AGCATGGATT TCATATCATC
CTGCTCGCGG AGGCAAGAGG CGGAAAGCAG GCGAGTCTCG AGGAAGTGAA GAAGCAGGTT
GAACAGGAAG TCAGGAAACA GAAAGCTGCA AAGACGTTTG GTGAAATGGC GGACAGCTTC
AGTAACATGG TGTATGAGCA GAGCGATAGC CTGAAACCGG TCGCGGAAAG TCTTGGCTTG
ACCATACAGG AGAGCGGCTG GATTCGGATG AACTCGGATG AGCCACCTTA TCTCAACAAT
GCACGGTTGC TTCAGGCAAT ATTTTCGGAA GACGCGATCA AGGATAAACG CAACACGGAG
GCGATCGAAG TTTCTTCCAA CACGCTTGTT TCGGCCAGAG TGGTGGACTA CAAACCGGCC
GCTACTCCTT CTGTGGATGA ATTGAGGGAT AAAATCGCGG CGTTGGTCGC CCGGGAGGAG
GCTTCCAGAG CGGCGATAAA TGAAGGGAAG GAGCAGCTTG CGCAACTGCA GCAGGGAAAG
ACTGCCCCAA TCAAATGGAC TCCCGGCCAG CAGGTTTCCA GAAGGGAGCG CCAGGGTTTT
GACAATGAGA CGGTGCAGGC GATTTTCAGG GCTGAAACAA ATCATCTCCC GGCCTATTCA
GGCCTGCCGA ATGCCCAAGG TGGTTTTACC CTGATTCGCG TTGACCGGGT CATCGAGTCA
CGGCCGCCCA GCGCAGAGGA GCGAAAAACT TTTGCAGGTC AACTGAGCCA GTTGTTCGCC
CAGGAAGAGT TTTCATCCTA TCTGGATGGA ATCAAGAAAA GGTATGATGT ATCGGTCAGG
AGTGAAAGCC TCGAGAAATA G
 
Protein sequence
MFDFVNKKRR IVQIILGLAT LPFVFWGVES YRNADAGDHV ALAAGEKISR QEFEQALRNQ 
QENMRATLGE NFSPALLEKP EVRAAILEGL IQQRLLRQEA SRVGLGVADA QLIEMIQNID
LFQEDGNFSK QRYEEWLRNQ GMNARAFEAR LRQDLMRQQL VDPFSKNAFI SHSVAERILR
LGDEKREVRV VQIQPEQFLA NVKPGNDAIR AYYEAHKAEF QVPEQARVEY VVLSMDALAE
KAQVTADEAL AYYEEHKHEF GQPEERRASH ILISAPASAS DRATARAKAE ELLAEVRKSP
QRFTELAKQH SQDPGSAPTG GDLGFFARNM MTKSFEDAVF RMKPGEISDI VETEHGFHII
LLAEARGGKQ ASLEEVKKQV EQEVRKQKAA KTFGEMADSF SNMVYEQSDS LKPVAESLGL
TIQESGWIRM NSDEPPYLNN ARLLQAIFSE DAIKDKRNTE AIEVSSNTLV SARVVDYKPA
ATPSVDELRD KIAALVAREE ASRAAINEGK EQLAQLQQGK TAPIKWTPGQ QVSRRERQGF
DNETVQAIFR AETNHLPAYS GLPNAQGGFT LIRVDRVIES RPPSAEERKT FAGQLSQLFA
QEEFSSYLDG IKKRYDVSVR SESLEK