Gene Nmul_A0710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0710 
Symbol 
ID3786056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp820197 
End bp821858 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content53% 
IMG OID637810792 
Productsulphate transporter 
Protein accessionYP_411409 
Protein GI82701843 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0792495 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCAG CTAATGCATC TACTGCTTCT ACCGCTAAAC TGACAGTACC GAAAGATGGT 
CTGGCGGGGA TGATGCAGAA TTTGAAATAC GATTTCACTG CGGGTTTTTT CGTATTCCTG
CTGGCGTTGC CTTTGAGCCT GGGTATTGCG AAGGCGGGAG ATTTCCCTCC TGCGATGGGT
GTTCTTACGG CCATGATCGG CGGGGTGCTG GTCGCCCTCT TGGCCGGTTC GCCGCTTACC
ATCAAAGGGC CTGCGGCGGG TTTGATTACA ATCGTCGCAG GGTGTGTTGC CGAAGTCGGC
GGCGGCCCGG AAGGGTGGAA GATGGCCTTG GGCGTCATGG TAGTAGCGGG TGCGCTGCAG
GTCGTCATCG GCTTTATGAG AGCAGGTTCA CTTAGCGATT TCTTTCCGTT ATCCGCGGTA
CATGGAATGC TGGCTGCGAT CGGGCTTATC ATTATTTCGA AGCAGGCGCA CATCCTTCTC
GGAATAGACC CGGCGACACT GGCGGGAATG GGACCCGTGA AGCTGTATGC GCAACTCCCT
AATTCCATCA TGAATCCGAA CGTGCCTGTA GCGATAGTCG GGATACTTAG TCTGATTGTC
CTATTTGGGT TGCCGAAGAT AAAAAGTCCA CTAGTGAAGA AAATTCCAGC TCCAATGGTG
GTATTGCTGA TAGCTATCCC CGCTGCCATC GCACTGGATT TCAAGGGGAC GCAACCTGGA
CACATACTCG TTCATATTGG CGATTTCTGG AAGGAAATTA CTTTCAATGC TGACTTCTCA
CAGATTGGAA CGGGTGCATT CTGGAAATAT GTGATCATGT TCCTGCTGGT AGGGAGCCTG
GAGTCCTGCC TTACCGTCAA GGCGATTGAC AGTCTCGATC CCTGGAGGCG TCAGTCGAAT
TTCAACAAGG ACTTGATCGC TGTTGGCACG GGTAATACGC TTGCTGCGGT ACTCGGTGGA
TCGCCCATGA TATCGGAAGT CGCACGCAGC TCGGCAAATG TAGGCTTCGG TGCACGCACC
CGCTGGGCCA ATTTCTTCCA CGGCTTCTGC CTCTTTATAT CGATGCTGCT CTTGATCCCT
GTGATCGAGA TGATTCCGAA TGCCGCCCTG GCAGCCATGC TGATTTTCGT CGGCTATCGC
CTGGCCTCGC CGCACGAGTT TTTCAAGACC TACAAGATCG GCAGCGAGCA GTTGACGATC
TTCCTGGTGA CGATTGCGGT GACAGTCGCC AGCGACCTGT TGATGGGTAT CGGAGCAGGT
ATTCTCGTCA AATTCGTCTT CCATATCGTA AATGGCGCAT CGCTCGGCAA CCTTTTCTCG
GCGCGCTATC AATTGAAGCA GACGGGAAAT CAGTACTATA TGAACGTTCA GGATGCCGCA
ATATTTTCCA ACCTGATCGG CTTCAAGAAA GTACTGGCCA GGTTTGAACC CAAAAAGGAA
GTCGTTCTTG ACTTCAGCCA GGCAACCCTG GTCGATCATA CCTTCATGGA ATTCCTGGAG
CATTTCGAAG AGCAATATGT CGAAAACGGC GGTACCGTGA CGGTGACTGG CTTCGATCGC
TTCCAACCTT TCTCAGCCCA TCCGCTCGCG GGAAGAAAGG CGAAGAAGGA AAAAGTCAGC
GCTACCGCGC CCTCGCTTGA TGGTAGCACG TCGAGGGAGT GA
 
Protein sequence
MSSANASTAS TAKLTVPKDG LAGMMQNLKY DFTAGFFVFL LALPLSLGIA KAGDFPPAMG 
VLTAMIGGVL VALLAGSPLT IKGPAAGLIT IVAGCVAEVG GGPEGWKMAL GVMVVAGALQ
VVIGFMRAGS LSDFFPLSAV HGMLAAIGLI IISKQAHILL GIDPATLAGM GPVKLYAQLP
NSIMNPNVPV AIVGILSLIV LFGLPKIKSP LVKKIPAPMV VLLIAIPAAI ALDFKGTQPG
HILVHIGDFW KEITFNADFS QIGTGAFWKY VIMFLLVGSL ESCLTVKAID SLDPWRRQSN
FNKDLIAVGT GNTLAAVLGG SPMISEVARS SANVGFGART RWANFFHGFC LFISMLLLIP
VIEMIPNAAL AAMLIFVGYR LASPHEFFKT YKIGSEQLTI FLVTIAVTVA SDLLMGIGAG
ILVKFVFHIV NGASLGNLFS ARYQLKQTGN QYYMNVQDAA IFSNLIGFKK VLARFEPKKE
VVLDFSQATL VDHTFMEFLE HFEEQYVENG GTVTVTGFDR FQPFSAHPLA GRKAKKEKVS
ATAPSLDGST SRE