Gene Nmul_A2572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2572 
Symbol 
ID3784652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2945553 
End bp2947646 
Gene Length2094 bp 
Protein Length697 aa 
Translation table11 
GC content60% 
IMG OID637812663 
Producthypothetical protein 
Protein accessionYP_413253 
Protein GI82703687 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.682256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCA CCCTCCACGC TGATAGAAAG AACAATCCCG ATCGCACCAG CAGCAACGAA 
TCCGGCAATG CATCCCTGCA TGATCTGATC GAAGCCGGCA GGCTTTCCCG CCGCGCCTTC
CTCCAACGTT CGCTAGGCCT TACGGCTATG GCATTCGGCG GCTCCCTGCT CGATGGCCTG
ATGCAGTTTG CTTACGCAGC GCCGGCGCCT GTGAACGGCA TCGGTTTCGA TTCAGTGCCT
GCCAACCTAT ATCTCTCACC CGGGGATGAC GCCGTTACCG TTCCCGCCGG CTATACCGCG
CGCGTACTGG TCGCCTGGGG AGACTCTCTT ACCCAGGCGC CGCACTGGAA TCCGGGGAGC
GCAATGACCG AGACGGTACA GTTGCACGCC TTCGGCGCCC ATGTGGACGG GATGCATCTG
TTTCCCTTCC CTCCCATGGG ATCGTCCGGA TCACCCGGCG TCGCCAATAT CCGCGGTCTG
CTGGTGACGA ACCATGAGTA CGTCGATCCT CCTCTGGTCA ATAACATCAC TCCCGCATCC
AGCTATGCGA CCACCCCCAT CACTCTCGAT ATGGTGCGCG CACAGCAGGC CGCGCATGGC
ATCAGCGTGG TCGAGGTGTG GAAAAAGAAC GGCATGTGGG AGGTCCAGCG TACTTCCGCC
TTCAACCGCC GCATTACCGG CAACTCCCTC TGCAAGCTGA GCGGACCCGC CGCCGGGCAT
GATCTGATGA AAACGGCCGC CGACCCCGGC GGCATGAATG TTCTCGGCAC ACTCAACAAC
TGCTCCAACG GCCATACCCC TTGGGGCACT TATCTCACCT GCGAGGAAAA CTGGAACGGC
TATTTCTCCA ATGAGACCGG AGATGTGGCA GGCGCAAACG ATCCGGAACA GAAGCGCCGG
ATTCTAAACG GGCAAGCGCG CTATGGTATC GGCAAGGGCG GGTTCGGTTA CCGCTGGCAC
GAAATGGATG CGCGTTTCCG TGCCGACCTT AATCCGAATG AGTCGCATCG CTTCGGCTGG
GTAGTGGAAA TCGATCCATG GGATCCAAAG AGCACGCCGG TGAAGCGCAC TGCGCTGGGA
CGTTTCAAGC ACGAAAATGC CAGTTGCGTC GTGGACCCAG ATAATACGGT CGTCATATAT
ATGGGTGATG ATGAGCGCAA CGAGTACCTT TACAAGTTTG TCTGCGCCAA TAAATATAAC
CCCCGCAACC GCGCGGCCAA CCGCGATCTG CTGGATTCCG GCACGCTTTT CGTGGCAAGG
TTCAATGCGG ATGGCGGCGG GAAGTGGCTG CCACTGGTCT GGAACCAGAA TGGACTGACG
CCGGCAAATG GCTTTGCCGA TCAAGCCGAA GTATTGATCA AGGCGCGCCA AGCAGCGGAT
CGCGCAGGCG CCACGATGAT GGATCGTCCG GAATGGATAG CAGCGCACCC CGCCTCGCGT
GAAGTCTATA TGACGCTTAC CAACAACAAC CGCCGTGGAA GCAATCCGCC GTCCGGCAAC
AGCATCGACG GAAGTACCCC TGCCGGCAGT GCGCGTCCTC CCGTCGATGC CGCGAATCCC
AGGCTGGACA ACCGTTACGG TCACATCATC CGGTGGCGCG AAAACATGGG CAAGGCCGAC
GCCCTGGATT TCGAATGGGA CATTTTCGTT GAATGCGGTG ACAAGCTGGA CCCGCAGCCG
CATCATCGCG GCAATATTAA CGGCGACGAT TACGGCGCCC CCGATGGCCT CTGGTTCGAT
CAGGATGGCC GCCTGTGGAT ACAGACCGAT CAGGCAGGTG ATGCCACTGG CGACTGGGCC
AATATCGGGG GAAATGTCAT GCTGTGCGCG AATCCTTCCA CGGGCGAGAC GCGCAGATTC
CTCACTGCGC CGAAGTACTG CGAGGTCACC GGAGTTACAA GTTCTCCGGA CGGCAAGGCC
CTGTTCGTCG GCATCCAGCA CCCGGGGGAG GATTGGGAAA CGCATTTTAC CCAGAACTCG
ACCTGGCCCG ACAGCGGTCA AAATGGCCCC ACTACGGCGG GCGGTTCCCC ATCCAAGCCC
CGCTCCGCCG TGGTGGTCAT CACTAAAGAT GATGGCGGCG TGATTGGCAC CTGA
 
Protein sequence
MSSTLHADRK NNPDRTSSNE SGNASLHDLI EAGRLSRRAF LQRSLGLTAM AFGGSLLDGL 
MQFAYAAPAP VNGIGFDSVP ANLYLSPGDD AVTVPAGYTA RVLVAWGDSL TQAPHWNPGS
AMTETVQLHA FGAHVDGMHL FPFPPMGSSG SPGVANIRGL LVTNHEYVDP PLVNNITPAS
SYATTPITLD MVRAQQAAHG ISVVEVWKKN GMWEVQRTSA FNRRITGNSL CKLSGPAAGH
DLMKTAADPG GMNVLGTLNN CSNGHTPWGT YLTCEENWNG YFSNETGDVA GANDPEQKRR
ILNGQARYGI GKGGFGYRWH EMDARFRADL NPNESHRFGW VVEIDPWDPK STPVKRTALG
RFKHENASCV VDPDNTVVIY MGDDERNEYL YKFVCANKYN PRNRAANRDL LDSGTLFVAR
FNADGGGKWL PLVWNQNGLT PANGFADQAE VLIKARQAAD RAGATMMDRP EWIAAHPASR
EVYMTLTNNN RRGSNPPSGN SIDGSTPAGS ARPPVDAANP RLDNRYGHII RWRENMGKAD
ALDFEWDIFV ECGDKLDPQP HHRGNINGDD YGAPDGLWFD QDGRLWIQTD QAGDATGDWA
NIGGNVMLCA NPSTGETRRF LTAPKYCEVT GVTSSPDGKA LFVGIQHPGE DWETHFTQNS
TWPDSGQNGP TTAGGSPSKP RSAVVVITKD DGGVIGT