Gene Cyan7425_4529 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan7425_4529 
Symbol 
ID7290482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 7425 
KingdomBacteria 
Replicon accessionNC_011884 
Strand
Start bp4602828 
End bp4604552 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content50% 
IMG OID643587501 
Productsulfatase 
Protein accessionYP_002485200 
Protein GI220909889 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.204344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAAAG ACAGTAATAG TAGCAGCCGG AGAGGCTTCC TTAAAAAAGC TGCTCTGATG 
GGGGGAACCT TATTGGGAGG TGGCACAACT GCTACAGCTA TCAGTCGAGC TGCCTCTGGT
CGATCGCCCT TAAAACAACC TAATATTTTG ATCGTCATTG CCGATCAACT TCGTTATCCC
GCCTGGTTTC CTAACCAAGC CCAACTGGAT CAAATTTTGC CCAATTTAGG GAGCCTGCGC
CGTCGGTCTG TAACGTTCAG AAATTATTAT GCAGCGGCAA CGGATTGCAC TCCAGCCCGC
TCCACCTTGC TGACCGGACT CTATACCCAT CAGACAGGGA TGTTTCTCAC CATTGTGGCA
GGAGGGAGCC CCCAACCCCC AGGTACACTG ACAGAACCCA CACTTGATCC AGGGTTTCCT
ACCTGGGGGG CAGCATTGAG AACCTTTGGT TATTCAACCT GGTGGTTTGG TAAATTTCAT
GAAAACAACT ACAATTCCGG CACTCTAGAA CCCTACGGTT TTTCCGGAGG AACCTGGCCC
GACCCCTATG GTTTCCCTGG TGAGGGAGAA AAAGGTGATC CCTGTATTAC GGATCAATTT
TTAGGCTGGC TTAAATCTCC TCAATCCGCT AATCAACCCT GGTGTACCAC GGTCAGTCTG
GTTCAACCCC ACGATATTGC CTATTATTTC AAGTTAACGG ATTGTCCCAA TGTGAACCAG
GGAACTAGCC TGCGGCTGAT CAAGCAACTT CCCGCCAACT TTGAGACTCC TGATAGCCTG
AAGACCAATA AGCCTTCGCT GCAATCTCTG TTTCTGCAAC AACAAATTAA AGGATTAGGG
CTCATCCCCT ACAACGGGCC AGGCTTTGAA CGGGAATGGT TGGAACTGCT GAATCTCTAC
CTGGTGTTTC AGAAAGAACT AGATGTCGAA GTTGGACGCA TTCTGCAGGC GATCGGCAAC
TCCCCCTATT CCGATAACAC TGTGATTATC TTCCTCTCAG ATCATGGCGA GTTAGGCGGT
TCCCACGGTC TCCGCGGTAA GGGTTCTTGT GTCTATGATG AGTCAATGCA GACCCCACTC
TATATCTATG ATCCGACTGG TCGATTTGCC AAATCCCCCT CTACAGAGCG TAAACAGTTG
GTTTCCAGCG TGGACATTAT GCCCTTATTG CTTACCCTGG CTAATAATGG CAATGCTCAA
GCCTGGTACA GCCGATACCC TTATCTGTCC AAGCGATTGA ATATCCTGCC CATTCTCCTC
AACCCAGCAG CTCGGGGACG GCAGTACGTT CTCCATACCA GTGATGAAGG CCCTATTGCC
GGGATTGATC CGACCGATCC CAGTACTCCC CCCAGTCATG TGATTGGCTA CCGCACCGCT
CGGGCAAAGT TGGGGCTATA TAGCCGCTGG AGAAACAATA GTTCCAATTT AGCAATTATC
GCCCGAGGTC AGGAAACTGA ATTGTATGAC TACACCCAAC AACAGCCCAG TCCAGGGCTT
TGGGAGATCA ACAATGTTGC CAGTACTAAC CCTTCTCTCT TAAAGCAGTA CTATGGGGCT
TTGCAACTGG CCATTACAAA TGAGTTACGG GCCCCTCTGC CCGGCCGCTT GAAATCGTTT
CAACGGCGTG CTTTTCATAG ATACTGGCAA TACATTAACC ACCCCTCCTC TTTTCGACCT
GCATGTTTTA ACCCTTCCCA AGCTCCCTGT CCTAGTCGTT CTTGA
 
Protein sequence
MHKDSNSSSR RGFLKKAALM GGTLLGGGTT ATAISRAASG RSPLKQPNIL IVIADQLRYP 
AWFPNQAQLD QILPNLGSLR RRSVTFRNYY AAATDCTPAR STLLTGLYTH QTGMFLTIVA
GGSPQPPGTL TEPTLDPGFP TWGAALRTFG YSTWWFGKFH ENNYNSGTLE PYGFSGGTWP
DPYGFPGEGE KGDPCITDQF LGWLKSPQSA NQPWCTTVSL VQPHDIAYYF KLTDCPNVNQ
GTSLRLIKQL PANFETPDSL KTNKPSLQSL FLQQQIKGLG LIPYNGPGFE REWLELLNLY
LVFQKELDVE VGRILQAIGN SPYSDNTVII FLSDHGELGG SHGLRGKGSC VYDESMQTPL
YIYDPTGRFA KSPSTERKQL VSSVDIMPLL LTLANNGNAQ AWYSRYPYLS KRLNILPILL
NPAARGRQYV LHTSDEGPIA GIDPTDPSTP PSHVIGYRTA RAKLGLYSRW RNNSSNLAII
ARGQETELYD YTQQQPSPGL WEINNVASTN PSLLKQYYGA LQLAITNELR APLPGRLKSF
QRRAFHRYWQ YINHPSSFRP ACFNPSQAPC PSRS