Gene Nmul_A1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1039 
Symbol 
ID3785166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1200673 
End bp1202043 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content59% 
IMG OID637811123 
Producthypothetical protein 
Protein accessionYP_411734 
Protein GI82702168 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.649912 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTAC TCATTCGCAC TCTTTTTGTC AGTGTCCTGG CGATGCTGTT TCTGCTGCCG 
TCAATTGTTC GAGCTTCCCC TGTCGTGGTG TTGAAAATCG ATGGCCCCAT TGCTCCGGCC
AGCGCCGATT TCATCCAGCG CGGACTGGAA CGCGCCGCGA ACGAGAACGC GCTGCTTGTG
GTGTTACAAC TGGACACGCC GGGAGGCCTC GACACTTCCA TGCGGCAGAT CATCCGCGGT
GTCCTGGCTT CACCCTTGCC CGTGGCGACT TTTGTGGCGC CGAGTGGGGC TCGGGCAGCC
AGTGCCGGCA CTTATATCCT GTATGCCAGC CATATCGCCG CGATGGCTCC GGGCACCAAC
CTGGGTGCCG CAACCCCAAT AGAAATGGGT GGCTTTCCCC GATCAGAGCC TGAACCCAGG
CCCCAACCGA AATCAGGCGA GAACCGCGCG CAAAATCCGG AAAAAGAAGG CCAGCTTCCC
GTAAAGGACG AAATGTCCCG CAAAATGATT CATGATGCTG CGGCGTACAT ACGCGGCCTG
GCGCAAATGC GCGGGCGTAA TGTGGAATGG GCGGAAAGAG CCGTCCGCGA AGCGGTCAGT
CTGTCGGCCT CCGAAGCGCT GCACCTCAAA GTTGTCGATT ATATCGCTAC CGACATCGCG
GATCTGCTGA AGCAGCTCAA TGGCAGGCAA GTGAACGTAC TGGGCCAGGA CCGTAAGCTC
GATACCACTT CCGCCACCAT GGAAGTCGTG GAACCCGACT GGCGCACGCG GCTGCTCGCC
ATCATCACCA ATCCAAGCGT CGCCTATGTC CTGATGCTTA TCGGCATTTA CGGACTGTTC
TTCGAGTTTG CCAACCCCGG TTTTGTGCTA CCCGGCGTGG CAGGCGCCAT CTGCCTGCTT
ATCGCCCTGT ATGCCTTCCA GTTACTGCCG GTGAGCTATG CCGGACTTGC CTTGATCCTG
CTCGGGATCG GGTTCATGGT GGCCGAAGTA TTCCTGCCCA GCTTCGGGGC CCTCGGTATC
GGCGGAATTA TCGCTTTCGT CGTGGGCTCC CTGATGCTGA TTGACAGTGA GGCGCCCGGT
TTCGGTATCC CCTGGACGCT CGTCGGCGGA GTAGCCTTCG CCAGTGCCCT GTTCCTGATT
TCAGTAATCG GCATGGCGCT CAAGACCCGC CGGAAACCGT TGCTGAGTGG TCAGGAGCAT
ATGGTCGGCT CAGTGGGAGA AATGCTGGAA GACACTTCCG GTGACGGCAT GGCGCGTATC
CGCGGCGAGT TGTGGACCGT GCACTCCGCC CAACCGCTGG TTCGGGGCCA GAAGGTGCGT
GTGACCGGAA TCGACGGCCT CATCCTGCAT GTAACCGCAG CAGAAAAATA A
 
Protein sequence
MKLLIRTLFV SVLAMLFLLP SIVRASPVVV LKIDGPIAPA SADFIQRGLE RAANENALLV 
VLQLDTPGGL DTSMRQIIRG VLASPLPVAT FVAPSGARAA SAGTYILYAS HIAAMAPGTN
LGAATPIEMG GFPRSEPEPR PQPKSGENRA QNPEKEGQLP VKDEMSRKMI HDAAAYIRGL
AQMRGRNVEW AERAVREAVS LSASEALHLK VVDYIATDIA DLLKQLNGRQ VNVLGQDRKL
DTTSATMEVV EPDWRTRLLA IITNPSVAYV LMLIGIYGLF FEFANPGFVL PGVAGAICLL
IALYAFQLLP VSYAGLALIL LGIGFMVAEV FLPSFGALGI GGIIAFVVGS LMLIDSEAPG
FGIPWTLVGG VAFASALFLI SVIGMALKTR RKPLLSGQEH MVGSVGEMLE DTSGDGMARI
RGELWTVHSA QPLVRGQKVR VTGIDGLILH VTAAEK