Gene Nmul_A2032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2032 
Symbol 
ID3784582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2327271 
End bp2328368 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content53% 
IMG OID637812121 
Productthiosulphate-binding protein 
Protein accessionYP_412719 
Protein GI82703153 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCCG ATAGTTCCCG TTTTGCTGTA GGTATATTTA TAGTGTTGGT GAGTCTCGGC 
GGCGCCTACG CTACTCTGAA TAATTTTTCC GATCGTCCTG CCGGCTCCGC TGAGCGGTTA
CACGAGGAAT TGAACGTCGG TTTTGCCTCT CACTGGAGAG CTCGGACGGG CGTGAACATC
AAGGTCGATC AGGCCCGGAG CAGGTCGGGG AAGCCCGTAC ATATTACGCT CGATGGGCTT
GATATTCCCG CCCTTGCGCT GTCCTACGAT GTGGACAAGC TGCATGATAA GGAAAGATTT
ATTGCACCTG ACTTCCGTCA GCTTTTAGCG CAGGATTTTC GGACTGGCTC TTATCCTTCC
CCCTATACCT CGACCATCGT ATTCCTGGTA AGGAAAGGCA ATCCAAAGAA ACTCAAGGAC
TGGGGCGATC TGGTACGCTC GGATATAAAA GTAGTTACAC CCAATCCAAG GCATTCTGAA
AGCGGCCGCT GGAATTATCT TGCGGCGTGG GGATACGCCG TGAGGCGATC AGGCGGCAGC
GAACAAGCTG CGCGTGAATT TGTCAGTCAG TTGTTTGCCA ATGTCCAGAC AGTGGATTAC
GAGGGTAAAA AGCCGGGAAA CTTGGGTGCC GCCTTTGTCT TTCGCAACAT CGGCGATGTA
CTCCTCACCT GGGAAAATGA AGCGTACCTG ATCGTTCAGA ACAGTGGAGC CGATAAGTTT
GAAGTCATCA CCCCGTCCAT ATCAATAGTG GCTGAACCCG CTATAAGTGT GGTGGACGCA
GCTGCACGTG GGAAAAGCAC ACGCCGTGTA GCGGCTTCAT ACATCGAATA CTTATATACA
CCCCAGGCGC AGCATATCGC CGCCAAGCAC TATTACCGTC CCCGCGATCC GGCCATTACC
ACGAAGTACG TGGACAGGTT TCCGCGCCTT GAGTTGTTTA CAGTTGACGA GGTTTCCAGT
GGCTGGCAGA AAGCCCAGAA AATACATTTT GCCAGGGGCG GTGTTTTTGA CCAGATCACC
GGTGATGTTC CGAATTCTGT CGCCGTGAGG GGCGCTATCG ATAGGGACCA TATTCAAGCC
GGCAATGCTA AAGGCTGA
 
Protein sequence
MTPDSSRFAV GIFIVLVSLG GAYATLNNFS DRPAGSAERL HEELNVGFAS HWRARTGVNI 
KVDQARSRSG KPVHITLDGL DIPALALSYD VDKLHDKERF IAPDFRQLLA QDFRTGSYPS
PYTSTIVFLV RKGNPKKLKD WGDLVRSDIK VVTPNPRHSE SGRWNYLAAW GYAVRRSGGS
EQAAREFVSQ LFANVQTVDY EGKKPGNLGA AFVFRNIGDV LLTWENEAYL IVQNSGADKF
EVITPSISIV AEPAISVVDA AARGKSTRRV AASYIEYLYT PQAQHIAAKH YYRPRDPAIT
TKYVDRFPRL ELFTVDEVSS GWQKAQKIHF ARGGVFDQIT GDVPNSVAVR GAIDRDHIQA
GNAKG