Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0865 |
Symbol | |
ID | 5705130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 968026 |
End bp | 969246 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270384 |
Product | threonine dehydratase |
Protein accession | YP_001535774 |
Protein GI | 159036521 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1171] Threonine dehydratase |
TIGRFAM ID | [TIGR01127] threonine dehydratase, medium form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.708894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0742081 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAAAC TGGTAGGCCT GGACGACGTA CGGGCCGCGC GGGAGTTGCT CGCCGGTGTC GTTCGGACCA CCCCGTTGGA ACCGTCCCGC CCGCTCAGCG GGGCGCTCGG CGGGCCGGTG TGGCTGAAGT GCGAGAACCT GCAGCGGGCC GGCTCCTACA AGGTGCGGGG GGCTTTCGTG CGGATCTCCC GGCTGTCGGC CGCGGAGCGC GCTGAAGGGG TGGTCGCGGC CAGCGCCGGC AACCACGCTC AGGGTGTGGC GCTGGCCGCC GGCCTGGTCG GCACGCACGC CACCGTGTTC ATGCCGGTCA ACGCGCCGTT GCCGAAGGTG GCGGCCACCA AGGGCTACGG CGCACAGGTC GAACTCGTCG GCAACACGGT CGACGAGTCG CTGGTGGCCG CGCAGACCTA CGCCGAGCGC ACCGGGGCGA CACTGATCCA CCCGTTCGAC CACCGCGACG TCGTCGCGGG GCAGGGCACG GTGGCGTTGG AGATCCTCGA ACAGTGCCCG CAGGTCCGCA CGATCGTCGC CGGCGTGGGC GGGGGCGGCC TGGTCTCCGG CATCGCGGTC GCGGCGAAGG CGCTGCGTCC GGACGTGCGC GTCGTCGGTG TGCAGGCGGC CACCGCGGCC GCCTTCCCGC CGTCGCTGGC CGCCGGTGAG CCGGTGCGGC TGCCGTCGTT CGGCACGATC GCGGACGGTA TCGCCGTCGG CCGTCCCGGG GAGCTGACCT TTCGCCATGT CCGCGCGTTG GTCGACGAGG TCGTGACGGT ACGCGAGGAG GACATCTCCC GCGCGCTGCT GATGTTGTTG GAGCGGGGTA AGCAGGTGGT TGAGCCGGCC GGTGCGGTGG GCGTGGCGGC GTTGCTGTCC GGCGCCGTCG ACGTCGAGGC ACCGGTGGTG GCGGTGCTCT CCGGCGGCAA CATCGACCCG CTGTTGATGC TGCGGGTGAT CGAGAACGGC CTCGCGGCGG CCGGGCGCTA CCTACGGGTC ACCGTCCGCT GTTCGGATCG ACCGGGCCAG CTCGCCTCGC TGCTCAGTGA GATCGCGGCG CAAGGGGCGA ACGTCGTGGA CGTCGGGCAC CAGCGTGCCA ACCCGCATCT GCGACTCGGC GAGGTCGAGG TGGCGCTGTC GGTGGAGACC CGGGGCACCG AACACTCGGA TAGGTTGATC GGCGTGCTAC GGGCCAGCGG CTACCAGGTG GTCTTCGCCG GTGAGGGGTG A
|
Protein sequence | MTKLVGLDDV RAARELLAGV VRTTPLEPSR PLSGALGGPV WLKCENLQRA GSYKVRGAFV RISRLSAAER AEGVVAASAG NHAQGVALAA GLVGTHATVF MPVNAPLPKV AATKGYGAQV ELVGNTVDES LVAAQTYAER TGATLIHPFD HRDVVAGQGT VALEILEQCP QVRTIVAGVG GGGLVSGIAV AAKALRPDVR VVGVQAATAA AFPPSLAAGE PVRLPSFGTI ADGIAVGRPG ELTFRHVRAL VDEVVTVREE DISRALLMLL ERGKQVVEPA GAVGVAALLS GAVDVEAPVV AVLSGGNIDP LLMLRVIENG LAAAGRYLRV TVRCSDRPGQ LASLLSEIAA QGANVVDVGH QRANPHLRLG EVEVALSVET RGTEHSDRLI GVLRASGYQV VFAGEG
|
| |