Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_2220 |
Symbol | |
ID | 5058683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | - |
Start bp | 2512374 |
End bp | 2513414 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640474483 |
Product | hypothetical protein |
Protein accession | YP_001159049 |
Protein GI | 145594752 |
COG category | [C] Energy production and conversion [G] Carbohydrate transport and metabolism |
COG ID | [COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCA TGTTCACCGT CGGGGGTAGC CAGTCCACCG TGTTCTGCAT CGCCCCGCTG GCGACCGCGG CCCGCAACGC CGGCCACGAG ATACTGCTGA CCGCCGACGA ACCGCTGCTG GAGACCGCGG CGGCGATCGG GTTGCCCGCG GTCCCCACTC CCCATCCGGG AGAGCCCGCG GCCCGGCTGC GGGCCCTGCT GGAGTTGGCG GGGCAGTGGC CGCCCGATGT GGTTGTCGGC GGCCTGTCAT TCGTCCCCGG CCTCGTCGCC GTCGAGACCC AGGCGCTGTA CGTGCGTCAC TACTGGGACA TCGCGCCGCT GCGGCCAGAG CCGGGAATCA GACCCGACCT GGAGCGACGT GGCCTGAGCG AGCCACCGGC GGCCGACCTG TTCATCGACG TCTGCCCGCC GGGCCTACGG CCGTCGCCCA CACCCGGCGC ACGGCCGATG CGCTGGATCC CGAAGAACCG GCAGCGTCGC CTCCAGCCGT GGATGTACAC CCGGCCCGCC GACCGCCCCC GGGTTCTGGT TACCGCGGGC ACGCGAAACC TCTTCCTCCA TTCACGCAGC AGCGTCATCC GGCACCTGGT TGACCAGCTC ATCGCGGCCG GCGCCGAGGT GGTGATCGCG GCACCGGAGC GAGCCGCCGC GGAGTTCGGC GCGCAGTTCC GCGATGTCCG CGTCGGCTGG GTCCCGCTGG ATGTCGTCGC CCCCACCTGC GATCTGGCGG TACACCACGG CGGCGCGGCC ACCACGATGA CAGTGATGAA CGCCGGCGCG CCGCATCTGG TCGTCCCCGA CAACGACTAC TCCCGGTCCG TCGCGAAGGC CCTCACCGCC GTCGGCGCGG GCCTGACCGC CGCCCCGGTG CCCCCGGAGG GTGATCGGGC GGCGGTCGAG GCGATCACCG CGAGTTGCCG GCAGATCCTT GCCGACCCCG GGTACGCCGA ACGCGCGCGC TTCCTGGCCG CCGAGATCGC CGGACTACCG ACGCCCGCCG AGGTCGTCCA CATGGTCGAG GAGTTGGCCG CGAGCCGGTA G
|
Protein sequence | MKFMFTVGGS QSTVFCIAPL ATAARNAGHE ILLTADEPLL ETAAAIGLPA VPTPHPGEPA ARLRALLELA GQWPPDVVVG GLSFVPGLVA VETQALYVRH YWDIAPLRPE PGIRPDLERR GLSEPPAADL FIDVCPPGLR PSPTPGARPM RWIPKNRQRR LQPWMYTRPA DRPRVLVTAG TRNLFLHSRS SVIRHLVDQL IAAGAEVVIA APERAAAEFG AQFRDVRVGW VPLDVVAPTC DLAVHHGGAA TTMTVMNAGA PHLVVPDNDY SRSVAKALTA VGAGLTAAPV PPEGDRAAVE AITASCRQIL ADPGYAERAR FLAAEIAGLP TPAEVVHMVE ELAASR
|
| |