Gene Cyan7425_1449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_1449 
Symbol 
ID7287371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011884 
Strand
Start bp1286139 
End bp1287626 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content50% 
IMG OID643584449 
Productsulfatase 
Protein accessionYP_002482181 
Protein GI220906870 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.499056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA TATTTCGGAA ATCGAGAAGG TTTCTCTTCG GCTTCCTTTT TGCAGTTTTT 
ACCTGTTGTA TAACCTGGAA GTTGCTCACT CTAAATCAGC AGGATTTGCC CGTGGCGGTA
GCTCAGCAAT CCTCCCAACC TCCCCATATC CTCTTTATTA TGTCGGATGA TCAGGGATGG
AAAGATGTCG GCTTTCATGG TTCTGATATT CGCACCCCAA ACCTGGACCA ACTGGCTAAG
ACCGGCGCAC GGCTTGAACA ATACTATTCC CAGCCCATGT GTACGCCATC GCGCGCGGCC
CTGCTGACAG GCAGATATCC CCATCGTTAT GGCCTGCAGA CTTTAGTCAT TCCTTCCGCA
GGCAAATATG GTCTGCCTAC CGATGAATAT TTGCTGCCCC AAGCGCTTAA AGAGGCTGGA
TACGAGACTG CGATCGTGGG CAAATGGCAC CTCGGTCATG CCGATCCCAA ATACTGGCCC
CGCCAACGGG GATTTGATTA TCAGTATGGC CCACTTCTCG GTGAGATCGA CTACTTCACC
CATTCAGCTC ATGGCAAGGT TGATTGGTAT CGCAATAACC AGTTGATTAA GGAAGAGGGT
TACGTAACGA CATTGCTGGG TCAGGATGCG GTAAAACTGA TCGAAAAACA CAACCCCAAA
ACTCCGCTCT TTCTTTATCT AGCCTTTACT GCTCCCCACG CTCCCTATCA GGCTCCACAA
AAGTATCTTG ACCAATACAA AACCATCGCT GATCCTAACC GTCGCGCCTA TGCCGCCATG
ATTACAGCCA TGGATGACCA AATTGGCCAG GTTGTAGCTG CGTTGGAAAA GCGCGGGATG
CGTAACAACA CTCTTATTGT TTTTCAGAGT GATAACGGCG GCCCACGCTC AGCTCAGTTC
ACGGGGGAAG TTGATACTTC CGGGGGCACA ATTCCAGCCG ATAATGGTCC CTACCGGGAT
GGCAAAGCGT CGCTTTATGA AGGTGGTACC AGGGTGGTTG CGCTCGCTAA CTGGCCCGGA
AAGATTCAGC CAGGCACAGT GGTGAACCAT CCAATTCATA TCGTCGATAT GTACCCCACC
CTGACAGGAC TGGCTAGTGT TTCGGTCGGT AAAAATAAAC CACTTGATGG CTTGAACATC
TGGCCCGCCC TGAGTGAAGC TAAGCCCTCT CCGCGCAGTC AAGTTGTCTA CGATATTGAG
CCTTTTCGTG CAGCTCTCAG TCAGGAAGAT TGGAAATTGG TCTGGAAGGC AACTCTACCC
TCCCGTCTCG AACTCTTCAA TCTGTCCCAG GATGTTTCCG AGCAAACCAA CCTGGCCGAG
CAGAACCCAG AAATTGTGTC CAGGCTAAAA CAACAAATTG AAGTGCTTTC TCGCGATGCT
GTTCTCCCAC CCCTGTTCTT AAAGGAAGCT GTTGGTGCAG CAAAGAGTAT ACTGTTTACC
TCAGTTTCTA CGCCCGAAGA TTCAGCGGAA ATCGAAAAAC AGCCCTGA
 
Protein sequence
MKNIFRKSRR FLFGFLFAVF TCCITWKLLT LNQQDLPVAV AQQSSQPPHI LFIMSDDQGW 
KDVGFHGSDI RTPNLDQLAK TGARLEQYYS QPMCTPSRAA LLTGRYPHRY GLQTLVIPSA
GKYGLPTDEY LLPQALKEAG YETAIVGKWH LGHADPKYWP RQRGFDYQYG PLLGEIDYFT
HSAHGKVDWY RNNQLIKEEG YVTTLLGQDA VKLIEKHNPK TPLFLYLAFT APHAPYQAPQ
KYLDQYKTIA DPNRRAYAAM ITAMDDQIGQ VVAALEKRGM RNNTLIVFQS DNGGPRSAQF
TGEVDTSGGT IPADNGPYRD GKASLYEGGT RVVALANWPG KIQPGTVVNH PIHIVDMYPT
LTGLASVSVG KNKPLDGLNI WPALSEAKPS PRSQVVYDIE PFRAALSQED WKLVWKATLP
SRLELFNLSQ DVSEQTNLAE QNPEIVSRLK QQIEVLSRDA VLPPLFLKEA VGAAKSILFT
SVSTPEDSAE IEKQP