Gene Nmul_A0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0956 
Symbol 
ID3785747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1108095 
End bp1109777 
Gene Length1683 bp 
Protein Length560 aa 
Translation table11 
GC content58% 
IMG OID637811039 
Producthypothetical protein 
Protein accessionYP_411651 
Protein GI82702085 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGT CTCCCGATAA AAGCGCGGTG GCCGCAACCG CGCGGCCAGC CATGGGTTTC 
CTCGCTACTG CCGGCAGCTG GCTGGACGAG AATATTTTCG CGCTAGGCCG TGAAATGCGG
TTGTCCTATC TGCCTCCACT CATGGTGTAC GTAGCTGCCG GCATTTCCGG GCTGACGGCA
ATCGTCGGCA CGTTTTACGT CAAGGAGCGG TTAGGGCTTT CGGCCGAATT TCTTGCGGCG
CTCGGGTTCT GGATGATGCT GCCATGGGCG TTGAAGATGC CGCTCGGGCA CCTCGTCGAC
CTCGTGTGGC GGTGGAAGAG CCTGCTGGTG TATCTGGGCG CCGGCTTGAT CACATTGAGC
CTGGTCATCA TGATCGGGCT GCTCGGACAC CTGGAGGAAA TGCGCGCCAT TGCTACTGTG
GAGGCGTGGT ACGTTACTGC GGCATTGCTC GCGCCTATTG GCTACGTCCT TCAGGATGTG
GTGGCGGATG CGATGACGGT TGAAGCCGTG CCTCGAGTCG ACAAGGACGG AAACCCGCTG
ACGCTTGAAA GCCGCAAATC CATGCATGTC ACGATGCAGA CGCTGGGGCG GGTCGCCATC
ATCAGCGGCG GAATTCTGGT CTCGATGGTG AATGTTTATG TGTTGCAAGG CGTGGGTGAG
TTACCCGAGG CGGGCAAGGC CGCGGCGTAC CTGTTCGTCT ATGAGCTTGC CTTGATCATC
CCGCTGGTTT CCGTTACCGG TGTTCTGTTC GCCTCATGGT TGAGGCGGCG AGATATTAAG
CATCTTGTGG CGCAGGGACA TAGCCGGATG GAATCCGAAG CGCTGCTTGG GGTTAACCCC
GATCCTCCTC CGGTGAACTG GTGGATACTT GGCGGCGGAT TGGCTTTCAC GGCAGTCTCC
CTGAGTGTAG GCCTGAGCCA GATACCCGGG GGAGAGGAAA TCATCTTCCT GGTTTCCATG
GCCATCGTCT TATTCCTGAT GTGGCGGCTG ACTGGCGAAC TGGAACCCGA CGCGCGCAAT
GTGCTGGTGG GCACGGCAAT CCTGATATTC GTGTTCCGCG CCATACCCGG GCCGGGGGCT
GGTTCGACCT GGTGGATGAT CGATCACCTT GGCTTCGATC AGCAGTTTCT GGCGACATTA
TCCCTGATCG GCGCGACCTT GACCCTGGCG GGAATGTTCA TCTTTCGGCG ATTCATGGCT
GAACGTTCGA TTGCCTATAT TATCGGTTGG CTTACCATCG TCGGCACCTT CTTGTCCCTC
CCCATAATCG GGATGTATTA CGGTTTGCAC GAATGGACGG CTGCCTTAAC GAACGGCATG
GTGGATGCGC GCTTTATCGC AGTGATCGAT ACGGCGCTGG AATCCCCCCT GGGTCAGATT
GCAATGATCC CGATGCTTGC GTGGATTGCC AACTCCGCAC CGGAAGCGCT CAAGGCCACC
TTCTTCGCCG TAATGGCCTC GTTCACCAAC CTTGCCCTGT CTGCGTCGCA GCTCGGGACA
AAGTACATGA ACCAGATTTT CAGGGTAACA CGCGAAGTGA CGGACCCCGA TACCGGGAAG
ATTACTGTTC CTGCCGACTA CAGCGAACTT GGTCCACTGC TGGTTTCAGT GACCGTGATT
GGCCTCGTAT TGCCGCTGCT GGCTATCTTC CTGCTCAGGT ATTCACGCTT CCGCAATGCG
TGA
 
Protein sequence
MSQSPDKSAV AATARPAMGF LATAGSWLDE NIFALGREMR LSYLPPLMVY VAAGISGLTA 
IVGTFYVKER LGLSAEFLAA LGFWMMLPWA LKMPLGHLVD LVWRWKSLLV YLGAGLITLS
LVIMIGLLGH LEEMRAIATV EAWYVTAALL APIGYVLQDV VADAMTVEAV PRVDKDGNPL
TLESRKSMHV TMQTLGRVAI ISGGILVSMV NVYVLQGVGE LPEAGKAAAY LFVYELALII
PLVSVTGVLF ASWLRRRDIK HLVAQGHSRM ESEALLGVNP DPPPVNWWIL GGGLAFTAVS
LSVGLSQIPG GEEIIFLVSM AIVLFLMWRL TGELEPDARN VLVGTAILIF VFRAIPGPGA
GSTWWMIDHL GFDQQFLATL SLIGATLTLA GMFIFRRFMA ERSIAYIIGW LTIVGTFLSL
PIIGMYYGLH EWTAALTNGM VDARFIAVID TALESPLGQI AMIPMLAWIA NSAPEALKAT
FFAVMASFTN LALSASQLGT KYMNQIFRVT REVTDPDTGK ITVPADYSEL GPLLVSVTVI
GLVLPLLAIF LLRYSRFRNA