Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1077 |
Symbol | |
ID | 5704345 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1208621 |
End bp | 1209634 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641270592 |
Product | short chain dehydrogenase |
Protein accession | YP_001535976 |
Protein GI | 159036723 |
COG category | [R] General function prediction only |
COG ID | [COG0300] Short-chain dehydrogenases of various substrate specificities |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00107266 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGGGCGCG GGCTACGGGA GATTCAGGTA GCGGTGGTCA CGGGTGCCAG CGGCGGCGTC GGGCGGGCCA CCGTACGGCA GCTGGCGCGG CCGGGGATCG CGATCGCCCT GTTGGCCCGC GGTCGCACCG GCCTCGACGC CGCGGCCGAG GACGTCCGCT CCGCCGGTGG CCACGCCATG CCGATCGAGG TGGACATGGC CGACTTCGAC CAGGTCGCCG CCGCCGGTCA GCGCGTCGAG GACGAACTGG GGCCGATCGA CCTGTGGATC AACGTGGCCT TCAGTTCGAT CTTCGCACCC TTCATGCAGA TTCGGCCCGA GGAGTTCCGC CGCACCGCTG AGGTCTCATA CCTCGGTTAC GTCTACGGGA CACGGGTGGC GTTGGATCAC ATGACCCGGC GCGATCGGGG CACCATCGTG CAGGTCGGGT CGGCCTTGGC CTACCGGGGA ATTCCCCTGC AGTCCGCCTA CTGCGGGGCC AAGCACGCCA TCGTGGGGTT CACCGAGTCA CTGCGCTGTG AGTTGCTGCA CGACAAGAGC AACGTCAAGG TCACCATGGT GCACCTGCCC GCGATGAACA CACCACAGTT CTCGTGGCTG CTGTCCCGGC TGCCACGGCA CGCCCAGCCG GTCCCGCCCA TCTACGAGCC GGAGGTCGCC GCCCGTTCCA TCGTCGCCGC CGCGGCCCGG CCCGGCCGGC GAGCGTACTG GGTGGGTACC CCCACGGCGC TGACCATCGT GGGTAACCGT CTGGTTCCGG GTCTGCTGGA TCGCTACCTG GGCCGGACCG GCTACCGCTC GCAGCAGACC GACCAGCCCG TCGACCCGGA CCAGCCGGCG AACCTGTGGC AGCCGGTCGA CGGACCGGGC GGCCACGACC ACGGCGCGCA CGGCGCGTTC ACCGACCGGT CACTGCGGCA CAGCCCGCAG GCGTGGCTGT CCCGGCACCG GATGGTCTCG GTAGCGGGAG TGGCCGGGCT GTTGTTCGGC GTTCTCGCCT GGCGTCGACA CTGA
|
Protein sequence | MGRGLREIQV AVVTGASGGV GRATVRQLAR PGIAIALLAR GRTGLDAAAE DVRSAGGHAM PIEVDMADFD QVAAAGQRVE DELGPIDLWI NVAFSSIFAP FMQIRPEEFR RTAEVSYLGY VYGTRVALDH MTRRDRGTIV QVGSALAYRG IPLQSAYCGA KHAIVGFTES LRCELLHDKS NVKVTMVHLP AMNTPQFSWL LSRLPRHAQP VPPIYEPEVA ARSIVAAAAR PGRRAYWVGT PTALTIVGNR LVPGLLDRYL GRTGYRSQQT DQPVDPDQPA NLWQPVDGPG GHDHGAHGAF TDRSLRHSPQ AWLSRHRMVS VAGVAGLLFG VLAWRRH
|
| |