Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A0699 |
Symbol | |
ID | 3786161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | + |
Start bp | 805874 |
End bp | 807133 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 637810781 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_411398 |
Protein GI | 82701832 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATCA TAGATTCCAC GCTGCAACCT CAGATTGATT CGACGCTCGA CGTCGCACGT GTGCGTGCCG ATTTTCCGAT TCTGGAGCTT CAGGTCGAAG GAAAGCCGCT GGTCTACCTC GATAATGCAG CCTCCAGCCA GATGCCGCAG CCCGTGATCG ATCGCCTGGT GCGTTACCAG ACAAGGCAGC ACGCAAATAT CAATCGGGGT GTGCATTATT TATCCGAAAC TGCCACGGCA GAATATGAGG CGGCGCGCTG TAAATTGCAG CGCTTCATCA ATGCGCGTGA AGACAGGGAG GTCATCTTTA CCAGCGGGAC GACAGATTCC ATCAACCTTG TGATGCACGG GTATGGACGC AAATTCATCG GCGCGGGTGA TGAAATCATC CTGACTACCT TGGAGCATCA TTCGAATATC GTTCCATGGC AGATGCTGGC GGAAGAAAAA GGGGCAAGGA TACGTGTCGT GCCGATCAAC GACGCCGGTG AACTTCTCAT CGATGATTAT GAGAAGCTTT TCAACGAGCG TACGAAGTTC GTGGGTGTGA TGCACGTATC GAATGCTTTG GGGACGATCA ACCCGGTCAG GGAAATGATC GCCTTCGCGC ATGCCCGCGG TGTTCCGGTG CTGGTGGATG GCGCGCAGGC TGCGCCGCAT ATGAAAGTGG ATGTCCAGGA GCTCGACTGC GATTTCTATG CATTTTCGGG ACACAAAATG TGTGGCCCGA CAGGCATTGG TATTCTGTAC GGAAAAGCGG AACTCCTGGA GCGCATGCAG CCGTTCAAAG GCGGCGGCGA CATGATTCTA TCGGTGACAT TCGAAAGAAC CACCTACAAT TCCATTCCAC ACAAATTCGA AGCAGGAACT CCGCCCATTG CAGCCGCTAT CGGGCTTGGC GCTGCCGTCG ATTATTTATC GAATGTCGGC ATCAATGCCA TCGCCGCGTA CGAGATTGAG CTGCTCAATT ACGCAACCGA ACAGATACTC CAGATACCCG GAGTGCGCAT CATAGGCACA GCAGCGAAAA AAACCGCGGT GCTGTCTTTC GAGGTGGCGG GTGTGCATCC GCACGATGTC GGTACGCTTT TGAATCAGGA AGGCGTTGCC GTTCGTACCG GCCATCACTG CGTGCAACCC GTGATGTTGC GCCTCAAGGT GCCTGCCACT ACACGCGCTT CATTTGCGTT TTATAATACA ATGGCTGAGG TAGATACTTT CATCGCTGGC ATTCGTACTG TGCAAAAGAT ATTTATTTAA
|
Protein sequence | MKIIDSTLQP QIDSTLDVAR VRADFPILEL QVEGKPLVYL DNAASSQMPQ PVIDRLVRYQ TRQHANINRG VHYLSETATA EYEAARCKLQ RFINAREDRE VIFTSGTTDS INLVMHGYGR KFIGAGDEII LTTLEHHSNI VPWQMLAEEK GARIRVVPIN DAGELLIDDY EKLFNERTKF VGVMHVSNAL GTINPVREMI AFAHARGVPV LVDGAQAAPH MKVDVQELDC DFYAFSGHKM CGPTGIGILY GKAELLERMQ PFKGGGDMIL SVTFERTTYN SIPHKFEAGT PPIAAAIGLG AAVDYLSNVG INAIAAYEIE LLNYATEQIL QIPGVRIIGT AAKKTAVLSF EVAGVHPHDV GTLLNQEGVA VRTGHHCVQP VMLRLKVPAT TRASFAFYNT MAEVDTFIAG IRTVQKIFI
|
| |