Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4467 |
Symbol | |
ID | 5708342 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5048597 |
End bp | 5049787 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641273883 |
Product | radical SAM domain-containing protein |
Protein accession | YP_001539232 |
Protein GI | 159039979 |
COG category | [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes |
TIGRFAM ID | [TIGR00423] radical SAM domain protein, CofH subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.82006 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGTGA GCCGGGAGAT CGACGACATC CTGCAACGCG GCGCGGACGG CGGGCGGATC ACGCCCGAGG AGGCCCTGCT GCTCTACACC GATGCGCCCT TCCACGCGCT GGGTGAGGCC GCCGACGTGG TCCGTCGGCG ACGGTATCCG GACAACATCG TCACCTACCT GATCGACCGC AACATCAACT ACACGAACGT CTGCGTGACG GCGTGCCGGT TCTGCGCCTT CTACCGCGCA CCCAAGCACC GAGAGGGTTG GACCCACTCG ACGGAGGAGA TCCTGCGTCG CTGTGGCGAG GCGGTCGGGC TGGGCGCCAC CCAGGTGATG CTGCAGGGTG GGCACCATCC CGACTACGGC GTGGAGTACT ACGAGGAGCT GTTCTCCTCG GTGAAGAGGG CGTACCCGCA GCTCGCCATC CACTCGATCG GCCCGAGCGA GATCCTGCAC ATGGCGAAGG TCTCCGGTGT GAGCCTGACC GAGGCCATCA CCCGGATCCA GGCTGCTGGC CTGGACTCGA TCGCCGGTGC CGGCGCCGAG ATGCTGCCGG CCCGGCCGAG GAAGGCGATC GCCCCGCTGA AGGAGTCCGG TGAGCGCTGG CTCGAGGTGA TGGAGCTGGC CCACCAGCAG GGCATCGAGT CGACCGCGAC GATGATGATG GGCACCGGTG AGACCGCCGC GGAGCGGATC GAGCACCTTC GGATGATCCG TGACGTGCAG GATCGCACGC GGGGTTTCCG GGCGTTCATC CCGTGGACCT ACCAGCCGGA GAACAACCAC CTCAAGGGCC GGACCCAGGC CACCACCCTG GAGTACCTGC GGCTGGTGGC GGTGTCCCGG CTGTTCTTCG AGACCGTGCC GCATCTCCAG GCGTCGTGGC TGACCACCGG CAAGGACGTC GGCCAGCTCG CCCTGCACAT GGGCGTCGAC GATCTGGGTT CGATCATGTT GGAGGAGAAC GTCATCTCCT CGGCCGGGGC CCGACACCGT TCGAACCTGC ATGAGCTGAT CGGGATGATC CGGTCGGCGG ACCGGATCCC CGCCCAACGG GACACCCACT ACCACCGGCT CGTCGTGCAC CGGACGCCCG CTGACGACCC CACGGACGAC CGGGTCGTGT CGCACTTCTC CTCGATCGCC CTGCCGGGTG GCGGCGCCGG GAAGGCGTTG CCACTGGTGG ACGCCGGCTG A
|
Protein sequence | MTVSREIDDI LQRGADGGRI TPEEALLLYT DAPFHALGEA ADVVRRRRYP DNIVTYLIDR NINYTNVCVT ACRFCAFYRA PKHREGWTHS TEEILRRCGE AVGLGATQVM LQGGHHPDYG VEYYEELFSS VKRAYPQLAI HSIGPSEILH MAKVSGVSLT EAITRIQAAG LDSIAGAGAE MLPARPRKAI APLKESGERW LEVMELAHQQ GIESTATMMM GTGETAAERI EHLRMIRDVQ DRTRGFRAFI PWTYQPENNH LKGRTQATTL EYLRLVAVSR LFFETVPHLQ ASWLTTGKDV GQLALHMGVD DLGSIMLEEN VISSAGARHR SNLHELIGMI RSADRIPAQR DTHYHRLVVH RTPADDPTDD RVVSHFSSIA LPGGGAGKAL PLVDAG
|
| |