Gene Nmar_0007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmar_0007 
Symbol 
ID5773972 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosopumilus maritimus SCM1 
KingdomArchaea 
Replicon accessionNC_010085 
Strand
Start bp5639 
End bp7168 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content33% 
IMG OID641315624 
Productpeptidylprolyl isomerase 
Protein accessionYP_001581345 
Protein GI161527519 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0652] Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAAAA TAATTCTTGC AATCACATTT TCATTATTGT TTATGGGAAT TGTAAATCAA 
ACAACTGCCC AATCTGTAGA GGTTAATCCA ATTAAGATAA TGGATCCTGT TGTAATCATT
GAAACTAGTT TGGGAAATAT CACAATTGGC TTTTTCCCAA ATGATGCACC TAAACATGTA
GAAAATTTTC TTAAACTATC AACCTCTGGA TTTTACGATG GAACTCTTTT TCATAGAATA
ATTCCAGGCT TCATGATTCA GGGCGGTGAT CCAAATACAA TTGATGGCGA TTCAAGTACC
TGGGGGACTG GTGGTCCAGA TGAAAGATTA GATGCAGAGT TTAACAATAT CAAACATAAT
CGTGGAATAG TTTCAATGGC AAGATCAGCT GATCCAAATA GCGGTGGCTC ACAATTCTTT
ATCGTACATC AAAATTCCAA CTTTCTTGAT GAACAATACA CTGTATTTGG TAGAATTATA
ACTGAAGAAA GTTTTGAAAC ACTTGATAAA ATTGCATCAG TTTCCACTGG AAGTAGAGAT
GAACCTATAA ATCCTGAACA AGTAAGAATT ATCAAAGTTT CAGTTGTTCC ACGTGCAGAT
ATTCCTGATT TAATTGAATT AACAGAACCT GATCGAATCC AAACAAACGT AGAGCCATCT
ACTGGGAGTC AACTATTTGA AAGCGAAGAA CATGATATAG CATTTAGTGC TCCTGCAGGC
TGGCTTTTAC AACAACCTGA GAAAACTCAA GAAAACACTC CTGATGTTGT AGCAGTTGGA
CCAAAGGTTG GCACAGTAAA TCCTGTAATT TCACTTACAA TACTTCCTAC AAATGGAAAA
ATCATTGATG ATATAATTTC AGAAAAAAAT GAAGAATTAC AACCACTTGC AGAATCAAGA
GGATTGAATA TAATTTCTCA AGAACAAATT ACCATTAATG ATAAGGATGC ATATGTAACT
AACGCACAAG GAGTTTTTTC TGCTAATGGT CAGGATTATG ATGTAAAATT CAAAGAGGTT
ATAATTTATG GTTCAGATAA CTATTACACT TTTTCGTACA GTAACGGTGT AGACGACTTT
GATTCTCAAA TAGAAAGATT TAACGAAACA ATAGATTCTT TTAAAAAATT ATCTGAAGAC
TCTGCAAATT CTGAAGAAAA TGGTGGATGT TTAATTGCAA CTGCAACATT TGGCTCTGAA
CTTGCACCTC AAGTACAACA ATTAAGAGAA TTAAGAGATA ACACTATTCT TGAAACAGAA
TCTGGAACTG CTTTTATGAG TGGATTTAAT CAATTGTATT ATTCATTCTC CCCAACAATT
GCTGATTTAG AACGTGAGTC TCCCCTCTTC AAAGAAATTG TGAAACTAAC AATCACTCCG
ATGTTGTCTT CACTTTCAAT TCTAAAGTAT GCTGAAATAA ATTCAGAAGA AGAAATGATC
TCTTATGGTG TAGGAATAAT TCTAATGAAT ATTGGAATGT ATTTTGTAGC TCCAACTATT
ATCATCTATA AAATTAGAAA ACTAAACTAG
 
Protein sequence
MKKIILAITF SLLFMGIVNQ TTAQSVEVNP IKIMDPVVII ETSLGNITIG FFPNDAPKHV 
ENFLKLSTSG FYDGTLFHRI IPGFMIQGGD PNTIDGDSST WGTGGPDERL DAEFNNIKHN
RGIVSMARSA DPNSGGSQFF IVHQNSNFLD EQYTVFGRII TEESFETLDK IASVSTGSRD
EPINPEQVRI IKVSVVPRAD IPDLIELTEP DRIQTNVEPS TGSQLFESEE HDIAFSAPAG
WLLQQPEKTQ ENTPDVVAVG PKVGTVNPVI SLTILPTNGK IIDDIISEKN EELQPLAESR
GLNIISQEQI TINDKDAYVT NAQGVFSANG QDYDVKFKEV IIYGSDNYYT FSYSNGVDDF
DSQIERFNET IDSFKKLSED SANSEENGGC LIATATFGSE LAPQVQQLRE LRDNTILETE
SGTAFMSGFN QLYYSFSPTI ADLERESPLF KEIVKLTITP MLSSLSILKY AEINSEEEMI
SYGVGIILMN IGMYFVAPTI IIYKIRKLN