Gene PCC7424_1202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC7424_1202 
Symbol 
ID7112331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7424 
KingdomBacteria 
Replicon accessionNC_011729 
Strand
Start bp1313275 
End bp1314918 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content43% 
IMG OID643479468 
Productsulfatase 
Protein accessionYP_002376520 
Protein GI218438191 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.191965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTAAAT ATGCTAATCG TCGGCGATCG CACAGACAGA TAGCACAAAG TCTATGGGCG 
ATCGCTTTAA CTATTACCTT TTTCCTTGTT GAAGGCTTCT TCTTTAGCAA TATAGCTTTA
GCACAAATCA ATAATGCACC CCTTAAGGCT GCACCCACAC CCCCTAAGAA ACCGAATATT
GTTGTCATTT GGGGGGATGA TATTGGTCAA AGCGATCTGA GTATCTTTAC TAAAGGAATG
ATGGGCTTTA AAACCCCCAA TATTGACCGG ATTGCCTCTG AAGGGATGCT TTTTACCGAT
TATTACGCGG AACAAAGTTG TACCGCCGGG CGTTCCTCTT TTATTACTGG ACAAAGCGTA
TTTCGGACAG GGTTAAGTAA AGTAGGGCTT CCTGGCGCTG ATATCGGCTT ACAGGAGGAA
GATCCCACTA TTGCCGAAAT GCTTAAACCA TTAGGCTATG CTACAGCCCA ATTCGGTAAA
AATCACCTGG GAGATAAAGA CGAGTTTTTA CCCACTAATC ATGGATTTGA TGAGTTTTAC
GGGAATTTAT ATCATTTAAA TGCAGAGGAA GAACCGGAAC TGCCTGACTA TCCTAAACCC
GACGAATTTC CCAATTTTCG TAAAAAGTAT GGGCCTCGTG GAGTCATTCA CAGTTTTGCC
GGTGGAACTA TTGAAGATAC AGGGCCTTTA ACTAAAAAAC GGATGGAAAC CATTGATGAT
GACATTGCTA ACCGTTCTGT TGAATATATC AAAAAACAAG CCGCAGCCGG TGAACCCTTC
TTTATGTGGA CGAACTTCAC CCATATGCAC TTTCGTACTC ATACTAAACC CGAAAGTTTA
GGACAAGCCG GACGTTGGCA GTCTCCTTAT CATGACACTA TGATTGACCA TGACAAAAAT
GTGGGTCAAA TTTTAGATGC TTTAGATGAA ACTGGACTGG GCGAAAATAC TATCGTTATT
TATGGGACAG ATAACGGCCC CCACATGAAC ACTTGGCCTG ATGCTGCCAT GACTCCCTTC
CGAGGGGAAA AAGATACGGG TTGGGAAGGT GCTTTCCGAG TTCCCTGTAT GGTACGCTGG
CCAGGTCATA TTGCGCCGGG GACTGTATCC AATGAGATTA TATCTAATCT GGATTGGATG
CCCACTTTAG TTGCTGCCGC CGGTGATCCT GATATTAAGG AAAAATTACT TAAAGGATAC
CGAGCCGGAC GTAGATCTTA TAAAGTACAC TTGGATGGTT ACAACTTTTT ACCCTATCTA
ACCGGTGAAA CCAGCGAAAG TCCTCGGCGA GAATATTTCT ACTTTTCTGA TGATGGGGAT
TTACTCGCTC TGCGTTACGA TAACTGGAAA CTCCATTTTG CTCAACAACG GAAGGAAGGA
ACTTTAGCCC TCTGGGGTGA ACCTTTTGTC AAAACTCGTA TTGCTTGGCT TTATAACCTC
CGGACTGACC CCTACGAACG AGCAACTCTT ACTTCAAATA CTTATTGGGA CTGGTATCTC
GATCATGTTT ATTTGTTGTT ACCTGCTCAA GATTTTGTGG CTGCTTTTCT AAACACTTTT
AAAGAGTATC CGCCCCGTCA GAAATCGGCT AGTTTTACCA TCGATCAGGT TTTAGAACTG
TTACAAACCC CCCCCAGTAG CTAA
 
Protein sequence
MIKYANRRRS HRQIAQSLWA IALTITFFLV EGFFFSNIAL AQINNAPLKA APTPPKKPNI 
VVIWGDDIGQ SDLSIFTKGM MGFKTPNIDR IASEGMLFTD YYAEQSCTAG RSSFITGQSV
FRTGLSKVGL PGADIGLQEE DPTIAEMLKP LGYATAQFGK NHLGDKDEFL PTNHGFDEFY
GNLYHLNAEE EPELPDYPKP DEFPNFRKKY GPRGVIHSFA GGTIEDTGPL TKKRMETIDD
DIANRSVEYI KKQAAAGEPF FMWTNFTHMH FRTHTKPESL GQAGRWQSPY HDTMIDHDKN
VGQILDALDE TGLGENTIVI YGTDNGPHMN TWPDAAMTPF RGEKDTGWEG AFRVPCMVRW
PGHIAPGTVS NEIISNLDWM PTLVAAAGDP DIKEKLLKGY RAGRRSYKVH LDGYNFLPYL
TGETSESPRR EYFYFSDDGD LLALRYDNWK LHFAQQRKEG TLALWGEPFV KTRIAWLYNL
RTDPYERATL TSNTYWDWYL DHVYLLLPAQ DFVAAFLNTF KEYPPRQKSA SFTIDQVLEL
LQTPPSS