Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_12791 |
Symbol | cysQ |
ID | 5730799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1154006 |
End bp | 1154986 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 641285648 |
Product | CysQ-like protein |
Protein accession | YP_001551164 |
Protein GI | 159903820 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1218] 3'-Phosphoadenosine 5'-phosphosulfate (PAPS) 3'-phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.386553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0458292 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTATAGAT TCCCTGAGGT TGGTTCGCTC GATTTTCCTA AAGCTTCGTT TTCTAAAAAA TCCATGAGTT CTACTGATTG TTTTCTTCCG AATGGGGTCA ATCTTGAAGA TCTATTAGAA GGTCTTAGAA GTTTGAGTTG GGGAGCTGCT GATATTTTGA TGGCATATGC CAGGGGAAGT AAGCCTCCTT ATGGTTTTCC AATGGAATTG GAGATCGAAG ATAATCCTGG AGGACCGGTC TCTGCAGCTG ACTTGGCCGT TAATAGTTGG TTGCTTGATG GTATTAACTC TAAGTTTCCT ACTGCAACTT GGAAGTTATT GAGTGAAGAG AATGCAAAAG AAGAATTTGT TGAAGGACTT TCAGCTTGTA ATGGTTGGAT CTGGATCTTA GACCCTCTTG ACGGGACTAA GGATTTTATT AAAGGAACGG GAGAATACGC CGTGCATTTG GCACTTGTGA ATGATCATCA TTTAAAAATG GGGGTCGTTT TGATTCCAGA GAAGGAAGAA TTATGGTTCG GAGTTTTAGG GGAAGGCGCG TGGTGTGAGA ATCGGTTGGG AGAAAAAAGG AATGTGAAAT TTAGTAATAG AACGCAAATT TCGGAAATGA TTCTTGTAGC AAGTAAAAGT CATAGAGATA AAACACTGTC TCAATTAATG GAAAGGATCT CTCCTGGAGA GACTAAAGGT ATTGGAAGTG TTGGATGCAA AGTAGGGACC ATACTTAGAG GAGAAGCAGA TTTCTATATA TCTTTGTCAG GCAAAACAGC TCCTAAAGAT TGGGATATGG CAGCACCTGA GGCAGTCCTA AGGGCTGCAG GAGGAGGATT TACACACGCC GATGGTAGAC CCCTTTCATA CAACAAAGAT AACTATGAGC AACGAGGATG CTTGATTGTT AGCCATGGTA AAAACCATGA CCTTATATGT AAACTAGCTG AAGATGAAAT AAAAAAATTA GACCCTTTTT TTGAAATTTA A
|
Protein sequence | MYRFPEVGSL DFPKASFSKK SMSSTDCFLP NGVNLEDLLE GLRSLSWGAA DILMAYARGS KPPYGFPMEL EIEDNPGGPV SAADLAVNSW LLDGINSKFP TATWKLLSEE NAKEEFVEGL SACNGWIWIL DPLDGTKDFI KGTGEYAVHL ALVNDHHLKM GVVLIPEKEE LWFGVLGEGA WCENRLGEKR NVKFSNRTQI SEMILVASKS HRDKTLSQLM ERISPGETKG IGSVGCKVGT ILRGEADFYI SLSGKTAPKD WDMAAPEAVL RAAGGGFTHA DGRPLSYNKD NYEQRGCLIV SHGKNHDLIC KLAEDEIKKL DPFFEI
|
| |