Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0540 |
Symbol | |
ID | 5705640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 611016 |
End bp | 612245 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641270066 |
Product | hypothetical protein |
Protein accession | YP_001535460 |
Protein GI | 159036207 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3483] Tryptophan 2,3-dioxygenase (vermilion) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00210127 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGTGCTTCG TGCGCGAGTT GAACTGTTGG TTGTCTGGCA CCACTGATCC TGCCGATTTT CCGTACCTCG CGGTACTGCG CGAATTTCAC GAAGTCGGCA AGCACTTCGT CGAGAAGGAG ACACTCTCAC TGCTCGACGA GAGTCGCGGC AGGGTGACGG GTCACCCTGC CGCAGGCCAC GACGACCCGG CGCGGTTGTT GCGCGACTTT CTCGACGTCG CGCTCGACAA ATGGGATGGC CGCTACGACT ACCGCAGCTA CCTGGCACTG CGCCTGATCG GGCTGTCCGG CGAGGCGGAG GAACCCACAT TCGGCGGGGA CGACGCCGCC AGGCGTCTGC TCCGTGACCG CCTTGTGGTC TGCCTGGTCG CCGACGTCCT GAATTTCGAA CTGGCTGCTG CCGCGCACGC CACCACTCTG CTTCCGCGGC AGCGCCCGGG GCTGACCGTG GTGGCCAAAC GCTGCCGGCT CGGGGTCCGG GCCGCCCTTC CCGCGCTGGC CCGGCTCGGC CTGACCGGGC TGGTGCAGGA GGGCGAGCCG ACGTCCGCCG CCGCCACGCT GCACGCCGTC GCCGTCGACC TCGACTCCGT CGGTGCCCTG CCGCTGCGAC TGAGCATGCT GCCGGTCCAC GTGACCCACG ACGAGTACCT GTTCCTCCGG GTACTTCAGG CGTACGAGTG CGTCTTCGCC GGTGTCGCCG ACGAACTGCG TGCCGTCATC GCCGCGATCG GCGTCGACGA CGCTCGTGCG GCAGCCGACC GGTTGGAATA CGCCCGGAAC CTGATCCTCA ACGCCGGTCC ACTCTTCTCA TTGCTGGCCA CCATGCAGCC GAAGTCGTTT CAGACATTCC GGCAGTACAC CGAAGGAGCC AGCGCCATCC AGTCACGGTC GTACAAGCTC GTCGAGTCGT TGTGCCGTGG GCCGGATCAG GACCGGCTCG ACTCGGCCGC GTATGCGGCG GTGCCGGAAC TGCGGGCCCT GGTCCGGGCT GGTCAGCCGA CGATCGACGA CGCGTACCGG TCGGCCGTGC GGGATGGGCG ACTCGTCGGC GCGGACCGCG ACCTGATCAC CCGACGGATG GAGTTGTTCG CCGAGACGCT GCTTCAGTGG CGACGCACCC ACCACCGGAT CGCGGTCCGG ATGCTCGGTC CCCGACCCGG CACCGGCTAC ACGGAGGGCA CACCGTACCT GGCGGCGGTC CGTGCCCTGC CGGTCTTCTT CACCGCCTGA
|
Protein sequence | MCFVRELNCW LSGTTDPADF PYLAVLREFH EVGKHFVEKE TLSLLDESRG RVTGHPAAGH DDPARLLRDF LDVALDKWDG RYDYRSYLAL RLIGLSGEAE EPTFGGDDAA RRLLRDRLVV CLVADVLNFE LAAAAHATTL LPRQRPGLTV VAKRCRLGVR AALPALARLG LTGLVQEGEP TSAAATLHAV AVDLDSVGAL PLRLSMLPVH VTHDEYLFLR VLQAYECVFA GVADELRAVI AAIGVDDARA AADRLEYARN LILNAGPLFS LLATMQPKSF QTFRQYTEGA SAIQSRSYKL VESLCRGPDQ DRLDSAAYAA VPELRALVRA GQPTIDDAYR SAVRDGRLVG ADRDLITRRM ELFAETLLQW RRTHHRIAVR MLGPRPGTGY TEGTPYLAAV RALPVFFTA
|
| |