Gene Nmul_A0699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0699 
Symbol 
ID3786161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp805874 
End bp807133 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content53% 
IMG OID637810781 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_411398 
Protein GI82701832 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATCA TAGATTCCAC GCTGCAACCT CAGATTGATT CGACGCTCGA CGTCGCACGT 
GTGCGTGCCG ATTTTCCGAT TCTGGAGCTT CAGGTCGAAG GAAAGCCGCT GGTCTACCTC
GATAATGCAG CCTCCAGCCA GATGCCGCAG CCCGTGATCG ATCGCCTGGT GCGTTACCAG
ACAAGGCAGC ACGCAAATAT CAATCGGGGT GTGCATTATT TATCCGAAAC TGCCACGGCA
GAATATGAGG CGGCGCGCTG TAAATTGCAG CGCTTCATCA ATGCGCGTGA AGACAGGGAG
GTCATCTTTA CCAGCGGGAC GACAGATTCC ATCAACCTTG TGATGCACGG GTATGGACGC
AAATTCATCG GCGCGGGTGA TGAAATCATC CTGACTACCT TGGAGCATCA TTCGAATATC
GTTCCATGGC AGATGCTGGC GGAAGAAAAA GGGGCAAGGA TACGTGTCGT GCCGATCAAC
GACGCCGGTG AACTTCTCAT CGATGATTAT GAGAAGCTTT TCAACGAGCG TACGAAGTTC
GTGGGTGTGA TGCACGTATC GAATGCTTTG GGGACGATCA ACCCGGTCAG GGAAATGATC
GCCTTCGCGC ATGCCCGCGG TGTTCCGGTG CTGGTGGATG GCGCGCAGGC TGCGCCGCAT
ATGAAAGTGG ATGTCCAGGA GCTCGACTGC GATTTCTATG CATTTTCGGG ACACAAAATG
TGTGGCCCGA CAGGCATTGG TATTCTGTAC GGAAAAGCGG AACTCCTGGA GCGCATGCAG
CCGTTCAAAG GCGGCGGCGA CATGATTCTA TCGGTGACAT TCGAAAGAAC CACCTACAAT
TCCATTCCAC ACAAATTCGA AGCAGGAACT CCGCCCATTG CAGCCGCTAT CGGGCTTGGC
GCTGCCGTCG ATTATTTATC GAATGTCGGC ATCAATGCCA TCGCCGCGTA CGAGATTGAG
CTGCTCAATT ACGCAACCGA ACAGATACTC CAGATACCCG GAGTGCGCAT CATAGGCACA
GCAGCGAAAA AAACCGCGGT GCTGTCTTTC GAGGTGGCGG GTGTGCATCC GCACGATGTC
GGTACGCTTT TGAATCAGGA AGGCGTTGCC GTTCGTACCG GCCATCACTG CGTGCAACCC
GTGATGTTGC GCCTCAAGGT GCCTGCCACT ACACGCGCTT CATTTGCGTT TTATAATACA
ATGGCTGAGG TAGATACTTT CATCGCTGGC ATTCGTACTG TGCAAAAGAT ATTTATTTAA
 
Protein sequence
MKIIDSTLQP QIDSTLDVAR VRADFPILEL QVEGKPLVYL DNAASSQMPQ PVIDRLVRYQ 
TRQHANINRG VHYLSETATA EYEAARCKLQ RFINAREDRE VIFTSGTTDS INLVMHGYGR
KFIGAGDEII LTTLEHHSNI VPWQMLAEEK GARIRVVPIN DAGELLIDDY EKLFNERTKF
VGVMHVSNAL GTINPVREMI AFAHARGVPV LVDGAQAAPH MKVDVQELDC DFYAFSGHKM
CGPTGIGILY GKAELLERMQ PFKGGGDMIL SVTFERTTYN SIPHKFEAGT PPIAAAIGLG
AAVDYLSNVG INAIAAYEIE LLNYATEQIL QIPGVRIIGT AAKKTAVLSF EVAGVHPHDV
GTLLNQEGVA VRTGHHCVQP VMLRLKVPAT TRASFAFYNT MAEVDTFIAG IRTVQKIFI