Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Syncc9902_0597 |
Symbol | |
ID | 3742639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus sp. CC9902 |
Kingdom | Bacteria |
Replicon accession | NC_007513 |
Strand | - |
Start bp | 603319 |
End bp | 604524 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637770768 |
Product | putative arylsulfatase regulatory protein |
Protein accession | YP_376609 |
Protein GI | 78184174 |
COG category | [R] General function prediction only |
COG ID | [COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGTAATCG CAGCTTCAAA CGTTCGGCCC GACTTCAGTC AGTTCGGGCC CATTGGCCTG GTGGTGATCC AGTCGACGTC GTTGTGCAAT CTTGATTGCT CATATTGTTA TTTGCCAGAT CGACAAAAAA AGCGAGTTTT TGATCTCAAT TTAATTCCTC TTTTAGTTGA GAGAATTTTA GAAAGCCCAT ACGCAGGACC GGAATTCTCA CTTGTATGGC ATGCCGGTGA ACCCCTCACT CTGCCAACAA ATTGGTACGA CGATGCCACT ACATTGATCA ATCAATCTCT TGAACACTTT GGTGCTCAAG ATCTAGAGAT TGACCAACAT GTGCAAACCA ATGCAACATT GATCAACAAC GATTGGTGTG ATTGCTTTAG GCGCAATGAA ATTGTGGTAG GCATCAGTGT TGATGGACCT GAAGACATTC ATGATGCCCA TCGACGCTTC CGCAACGGGC GTGGATCCCA TGCCATGGCG ATGCGGGGAA TTGAAGCCTT ACATCGAAAT CAAGTGCCGT TCCACTGCAT CTCTGTGATC ACTGCAGATG CCATGGAACA ACCCGAGCGT ATGTACCGAT TTTATCGAGA CAATGGCATC AATGATGTGG GTTTCAATGT TGAAGAGAAG GAAGGAATCA ATACATCATC TTCAATGGCA GGCTCAAATA TGGAGGCTAA GTATAAAGAT TTTCTCCGAA CGTTTTGGCG ACTAAGCGAG CAAGACGGTT ATCCCGTTGT CTTACGTGAA TTTGAACAGG TGATTAGCCT CATACAGGGG GATCGCCGAA TGAAGCAGAA CGAACTGAAC CGCCCCTTTT CAATTTTGAG CGTTGACGCC CAAGGTGATT TTTCAACGTT CGATCCGGAA CTGCTTTCCG TCGCCAGCGA CCGCTACGGC ACCTTCAATC TTGGCAACCT TAAAACGCAC AGCCTCGAAG AATCAACGCG GACAGAGTCT TTTCAGCGTC TGCTTCAAGA CATGACCCAA GGGGTGGAGA CATGCCACAA GGGTTGCGAA CACTTTGGCT TGTGTGGAGG CGGGAATGGA AGCAACAAGT TTTGGGAGCA CGGCACCCTC GCCTCAAGTG AAACCAATGC CTGCCGCTTC GGCACCAAAA TCCCCGTGGA AGTGCTTCTC GAGCGGTTCG AAGAGAGCCC ACCCATTGAG GTCAACCGAA CAACCACGGC GTCCCGAAGT TCGTAG
|
Protein sequence | MVIAASNVRP DFSQFGPIGL VVIQSTSLCN LDCSYCYLPD RQKKRVFDLN LIPLLVERIL ESPYAGPEFS LVWHAGEPLT LPTNWYDDAT TLINQSLEHF GAQDLEIDQH VQTNATLINN DWCDCFRRNE IVVGISVDGP EDIHDAHRRF RNGRGSHAMA MRGIEALHRN QVPFHCISVI TADAMEQPER MYRFYRDNGI NDVGFNVEEK EGINTSSSMA GSNMEAKYKD FLRTFWRLSE QDGYPVVLRE FEQVISLIQG DRRMKQNELN RPFSILSVDA QGDFSTFDPE LLSVASDRYG TFNLGNLKTH SLEESTRTES FQRLLQDMTQ GVETCHKGCE HFGLCGGGNG SNKFWEHGTL ASSETNACRF GTKIPVEVLL ERFEESPPIE VNRTTTASRS S
|
| |