Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3306 |
Symbol | |
ID | 5703660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3816455 |
End bp | 3817759 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272733 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001538100 |
Protein GI | 159038847 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000348872 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCACCA TCGCGATTCC ACCGGGCATG CCGCAGTACG GCGACGTGCC GCGCTACGAC GTGGCGGCGG TGCGCGCCGA CTTCCCGATC CTGGACCGGA CGGTTAACGG GCACCCGCTG GTCTACCTGG ACAGCGCGAA CACGTCGCAC AAGCCACGGC AGGTGCTCGA CGTACTGCGG GAGCACTACG AGCGGCACAA CGCCAACGTG TCGCGTTCGG TTCACACGCT GGGCACCGAG GCCACCGAGG CGTACGAGGG GGCGCGGGCC AAGGTCGCCG CCTTCATCAA CGCTCCGAAC CCGGACGAGG TGGTGTTCAC CAAGAACTCC ACCGAGGCGA TCAACATCGT GGCGTACGCC TTCTCGAACG CGTCGCTGCG CCCCGACGCC GACCCGCGGT TCCGGTTGGG CCCCGGAGAC GAGGTGGTGA TCTCCGAGAT GGAGCACCAC TCGAACATCG TCCCGTGGCA GCTGATCTGC GAGCGTACCG GCGCCACGTT GCGCTGGTTC CCGGTCACCG ACCACGGTCG ACTCGACGAG TCGGGTCTGG CGGACCTGGT CACCGAGCGG ACGAAGATCG TCTCACTGGT GCACATGTCC AACATCCTCG GCACGGTCAA CGCCACGTCC CGGATCACCC AGCGGGTCCG TGAGGTCGGC GCACTGCTGC TGCTCGACTG TTCGCAGTCG GTGCCGCACA TGCCGATGGA CGTGGTCGAC TACGACGCGG ACTTCATCGT CTTCACCGGG CACAAGATGT GTGGCCCGAC CGGTATCGGA GTGCTCTGGG GCCGGTCCGA GCTGCTCGCG GCGATGCCGC CGGTGCTCGG CGGCGGGTCG ATGATCGAGA CGGTGGCGAT GTCGGGGTCG ACCTTCGCCG CGCCGCCGGC CCGGTTCGAG GCGGGCACCC CACCGATCGC CGAGGCGGTC GCGCTGGGCG CGGCGGTGGA CTACCTGTCC GGCGTCGGCA TGCGGGCCAT CCAGTGGCAC GAGAAGCATC TCACGGCGTA CGCCCTGGAC GCTCTGGCGA CGGTGCCCGG GTTACGGGTC TTCGGGCCGA CCGTGCCGGT GGGTCGGGGT GGCACGATCT CGTTCGCGCT GGGCGACATC CACCCGCACG ACGTTGGGCA GGTGCTCGAC TCGCTGGGTG TGCAGGTGCG GGTCGGTCAC CACTGTGCCC GTCCGGTCTG CACCCGGTTC GGCGTGCCCG CGATGACCCG GGCCTCGTTC TACCTCTACA CCACCACGGA GGAGATCGAC GCCTTGGTGG CGGGTCTGGA GCGGGTGCGG AAGGTGTTCG ACTGA
|
Protein sequence | MTTIAIPPGM PQYGDVPRYD VAAVRADFPI LDRTVNGHPL VYLDSANTSH KPRQVLDVLR EHYERHNANV SRSVHTLGTE ATEAYEGARA KVAAFINAPN PDEVVFTKNS TEAINIVAYA FSNASLRPDA DPRFRLGPGD EVVISEMEHH SNIVPWQLIC ERTGATLRWF PVTDHGRLDE SGLADLVTER TKIVSLVHMS NILGTVNATS RITQRVREVG ALLLLDCSQS VPHMPMDVVD YDADFIVFTG HKMCGPTGIG VLWGRSELLA AMPPVLGGGS MIETVAMSGS TFAAPPARFE AGTPPIAEAV ALGAAVDYLS GVGMRAIQWH EKHLTAYALD ALATVPGLRV FGPTVPVGRG GTISFALGDI HPHDVGQVLD SLGVQVRVGH HCARPVCTRF GVPAMTRASF YLYTTTEEID ALVAGLERVR KVFD
|
| |