Gene Nmul_A1535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1535 
Symbol 
ID3785608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1753640 
End bp1754836 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content54% 
IMG OID637811623 
Producthypothetical protein 
Protein accessionYP_412230 
Protein GI82702664 
COG category[S] Function unknown 
COG ID[COG3503] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000132608 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCAAAC CCTCTCCTGC ACCTTCCTCA ACTCCTCCAC CCTATATTAC TTCTGTCGAT 
CTCATGCGTG GGCTGGTCAT GGTTCTCATG GCCCTTGACC ATGTGCGCGG ATTCTTCACC
AATGCCGATT TCAGCTCCAC TGATCTTGCC CGCACTACTC CCGGCGTTTT CCTGACACGC
TGGATCACTC ACCTGTGCGC CCCCACCTTT GTGTTTCTGG CAGGCACCAG TGCTTATCTG
TCTGCATCGC GGGGGATGGC CCACTCCCAG TTGGCGAAAC GATTATTCCT GCGGGGGTTG
TGGCTGGTAT TTCTTGAACT GACGGTGGTG CGTTTTGCCT GGTTCTTCAA TCTGGATTAC
AACTTGATGG ACCTGCAGGT GATCTGGGCG CTGGGGTGGT CCATGATTGT TCTCGCGGCG
CTCGTCTATC TGCCGCTATG GGCGGTCGCC GGCGTTGGCA TCGGCATGAT CCTGACTCAC
AATCTGCTCG ACAGCATACG GCTTGAGGAT TTTCAGGCAG CGGGCGGCTC GCTCACCTGG
AAAGGGTGGC TGGTGAGCGT GCTCCACATC CCCCACTTTC CGGTGGTGTA CCCTCTCATC
CCCTGGATCG GAGTAATGGC CTCGGGTTAT GCATTTGGTC CTCTCATGCT GCTGCCATCG
AGGACAAGAA TAAAGGTAGT ATTTAAATGC GGCACGATTC TTGTCGCAGG CTTCCTCATT
CTCCGCGGCC TGAATATTTA TGGCGATCCG GATCCATGGG TTTTGCAGGA AACCCCCGTG
TTCACACTAC TTTCCTTCCT GAATACGACG AAATATCCAC CGTCACTGCT CTATCTGCTG
ATGACACTCG GACTCATGTT CCTGCTGATA TCTGCATTTG AATGGTGGCA TGAGGCGCAC
GGGCCGCACG GTGTCGCAGG ACGTTTTCTG ATTACCTTTG GCCGCGTGCC CCTGTTTTTC
TATCTCATTC ATCTGTATTT TATTCATGGC TTCACCCTGC TCATTGCGTT TGCAATGGGA
GCGAATATTC GCTCCTTCCT GACCTCTTCC TGGGAATTCC CATCCTGGTG GGGATTCAGC
CTTCCAGTTG TTTACCTGGT ATGGGTGGGA GTCACTACCA CACTCTATCC CATCTGCCGC
CGGTTCGCGG CATTAAAATC TCGCCACCGG GGCAGTTGGT GGACGCCCTA TATCTAA
 
Protein sequence
MAKPSPAPSS TPPPYITSVD LMRGLVMVLM ALDHVRGFFT NADFSSTDLA RTTPGVFLTR 
WITHLCAPTF VFLAGTSAYL SASRGMAHSQ LAKRLFLRGL WLVFLELTVV RFAWFFNLDY
NLMDLQVIWA LGWSMIVLAA LVYLPLWAVA GVGIGMILTH NLLDSIRLED FQAAGGSLTW
KGWLVSVLHI PHFPVVYPLI PWIGVMASGY AFGPLMLLPS RTRIKVVFKC GTILVAGFLI
LRGLNIYGDP DPWVLQETPV FTLLSFLNTT KYPPSLLYLL MTLGLMFLLI SAFEWWHEAH
GPHGVAGRFL ITFGRVPLFF YLIHLYFIHG FTLLIAFAMG ANIRSFLTSS WEFPSWWGFS
LPVVYLVWVG VTTTLYPICR RFAALKSRHR GSWWTPYI