Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4732 |
Symbol | |
ID | 5704557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5354057 |
End bp | 5354932 |
Gene Length | 876 bp |
Protein Length | 291 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641274130 |
Product | dihydropteroate synthase |
Protein accession | YP_001539476 |
Protein GI | 159040223 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0294] Dihydropteroate synthase and related enzymes |
TIGRFAM ID | [TIGR01496] dihydropteroate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0090044 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000281366 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGACCGATC TGGTGCAGGC CGCGTCCCCG GTGGTGATGG GCATCCTGAA CGTCACGCCG GACTCCTTCT CCGACGGCGG ACGCTACGCC GATCTCGACG CGGCCATCGC ACACGGGGTG CGGCTGCGTG ACGACGGCGC CCACCTGGTG GACGTGGGCG GCGAGTCGAC CCGTCCCGGG GCCGACCGGG TGGACGCCGA GACCGAGGCC ACCCGGGTGC TGCCGGTGGT GCGCGAGCTG ACGGCCGCCG GGGTGCCGGT CAGCATCGAC ACCACCCGGG CCCGGGTGGC CGAGCTGGCG CTCGCCGCCG GGGCGGCCGT GGTCAACGAC GTCTCTGGCG GGCTCGCCGA CCCGGACATG GCCGGAGTCG TCCGGGACGC CGGCTGTCCC TGGGTGCTCA TGCACTGGCG TGGGCACTCC CGCACGATGC GCGAGCTGGC CCGCTACGGC GACGTCGTCA CCGACGTCCG GACCGAGCTG GCGCAGCGGG TCGAGGCGGC ACTCGCAGCT GGTGTGGCGG CCGATCGCAT CGTCATCGAC CCGGGGCTCG GCTTCGCGAA GACCGCCGCG CACAACTGGG AGCTGAGCGC CCGGCTGCCG GAACTGCTGA CGCTCGGCTA CCCGCTGCTC TTCGCGGCCA GCCGTAAGTC TCACCTGGGC GCGTTGCTCG CCGCACCGGA CGGCGTGCCG CGGCCTGTCG AGGGCCGGGA GATCGCCACG GTGGCCACCA GCGTGCTCGC GGTCGCGGCG GGTGCCTGGG GCGTGCGGGT ACACGACGTC CGAGCCACCA CGGATGCGCT GGCTGTCTGG CGGGCCACCG GGTCACCCCG GCTGGCCACC ACCGCCGGCA CCCTGGCCAA GGGAGGGAAG CGGTGA
|
Protein sequence | MTDLVQAASP VVMGILNVTP DSFSDGGRYA DLDAAIAHGV RLRDDGAHLV DVGGESTRPG ADRVDAETEA TRVLPVVREL TAAGVPVSID TTRARVAELA LAAGAAVVND VSGGLADPDM AGVVRDAGCP WVLMHWRGHS RTMRELARYG DVVTDVRTEL AQRVEAALAA GVAADRIVID PGLGFAKTAA HNWELSARLP ELLTLGYPLL FAASRKSHLG ALLAAPDGVP RPVEGREIAT VATSVLAVAA GAWGVRVHDV RATTDALAVW RATGSPRLAT TAGTLAKGGK R
|
| |