Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_1202 |
Symbol | |
ID | 7112331 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | - |
Start bp | 1313275 |
End bp | 1314918 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643479468 |
Product | sulfatase |
Protein accession | YP_002376520 |
Protein GI | 218438191 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.191965 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTAAAT ATGCTAATCG TCGGCGATCG CACAGACAGA TAGCACAAAG TCTATGGGCG ATCGCTTTAA CTATTACCTT TTTCCTTGTT GAAGGCTTCT TCTTTAGCAA TATAGCTTTA GCACAAATCA ATAATGCACC CCTTAAGGCT GCACCCACAC CCCCTAAGAA ACCGAATATT GTTGTCATTT GGGGGGATGA TATTGGTCAA AGCGATCTGA GTATCTTTAC TAAAGGAATG ATGGGCTTTA AAACCCCCAA TATTGACCGG ATTGCCTCTG AAGGGATGCT TTTTACCGAT TATTACGCGG AACAAAGTTG TACCGCCGGG CGTTCCTCTT TTATTACTGG ACAAAGCGTA TTTCGGACAG GGTTAAGTAA AGTAGGGCTT CCTGGCGCTG ATATCGGCTT ACAGGAGGAA GATCCCACTA TTGCCGAAAT GCTTAAACCA TTAGGCTATG CTACAGCCCA ATTCGGTAAA AATCACCTGG GAGATAAAGA CGAGTTTTTA CCCACTAATC ATGGATTTGA TGAGTTTTAC GGGAATTTAT ATCATTTAAA TGCAGAGGAA GAACCGGAAC TGCCTGACTA TCCTAAACCC GACGAATTTC CCAATTTTCG TAAAAAGTAT GGGCCTCGTG GAGTCATTCA CAGTTTTGCC GGTGGAACTA TTGAAGATAC AGGGCCTTTA ACTAAAAAAC GGATGGAAAC CATTGATGAT GACATTGCTA ACCGTTCTGT TGAATATATC AAAAAACAAG CCGCAGCCGG TGAACCCTTC TTTATGTGGA CGAACTTCAC CCATATGCAC TTTCGTACTC ATACTAAACC CGAAAGTTTA GGACAAGCCG GACGTTGGCA GTCTCCTTAT CATGACACTA TGATTGACCA TGACAAAAAT GTGGGTCAAA TTTTAGATGC TTTAGATGAA ACTGGACTGG GCGAAAATAC TATCGTTATT TATGGGACAG ATAACGGCCC CCACATGAAC ACTTGGCCTG ATGCTGCCAT GACTCCCTTC CGAGGGGAAA AAGATACGGG TTGGGAAGGT GCTTTCCGAG TTCCCTGTAT GGTACGCTGG CCAGGTCATA TTGCGCCGGG GACTGTATCC AATGAGATTA TATCTAATCT GGATTGGATG CCCACTTTAG TTGCTGCCGC CGGTGATCCT GATATTAAGG AAAAATTACT TAAAGGATAC CGAGCCGGAC GTAGATCTTA TAAAGTACAC TTGGATGGTT ACAACTTTTT ACCCTATCTA ACCGGTGAAA CCAGCGAAAG TCCTCGGCGA GAATATTTCT ACTTTTCTGA TGATGGGGAT TTACTCGCTC TGCGTTACGA TAACTGGAAA CTCCATTTTG CTCAACAACG GAAGGAAGGA ACTTTAGCCC TCTGGGGTGA ACCTTTTGTC AAAACTCGTA TTGCTTGGCT TTATAACCTC CGGACTGACC CCTACGAACG AGCAACTCTT ACTTCAAATA CTTATTGGGA CTGGTATCTC GATCATGTTT ATTTGTTGTT ACCTGCTCAA GATTTTGTGG CTGCTTTTCT AAACACTTTT AAAGAGTATC CGCCCCGTCA GAAATCGGCT AGTTTTACCA TCGATCAGGT TTTAGAACTG TTACAAACCC CCCCCAGTAG CTAA
|
Protein sequence | MIKYANRRRS HRQIAQSLWA IALTITFFLV EGFFFSNIAL AQINNAPLKA APTPPKKPNI VVIWGDDIGQ SDLSIFTKGM MGFKTPNIDR IASEGMLFTD YYAEQSCTAG RSSFITGQSV FRTGLSKVGL PGADIGLQEE DPTIAEMLKP LGYATAQFGK NHLGDKDEFL PTNHGFDEFY GNLYHLNAEE EPELPDYPKP DEFPNFRKKY GPRGVIHSFA GGTIEDTGPL TKKRMETIDD DIANRSVEYI KKQAAAGEPF FMWTNFTHMH FRTHTKPESL GQAGRWQSPY HDTMIDHDKN VGQILDALDE TGLGENTIVI YGTDNGPHMN TWPDAAMTPF RGEKDTGWEG AFRVPCMVRW PGHIAPGTVS NEIISNLDWM PTLVAAAGDP DIKEKLLKGY RAGRRSYKVH LDGYNFLPYL TGETSESPRR EYFYFSDDGD LLALRYDNWK LHFAQQRKEG TLALWGEPFV KTRIAWLYNL RTDPYERATL TSNTYWDWYL DHVYLLLPAQ DFVAAFLNTF KEYPPRQKSA SFTIDQVLEL LQTPPSS
|
| |