Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan7425_4529 |
Symbol | |
ID | 7290482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7425 |
Kingdom | Bacteria |
Replicon accession | NC_011884 |
Strand | + |
Start bp | 4602828 |
End bp | 4604552 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643587501 |
Product | sulfatase |
Protein accession | YP_002485200 |
Protein GI | 220909889 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.204344 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATAAAG ACAGTAATAG TAGCAGCCGG AGAGGCTTCC TTAAAAAAGC TGCTCTGATG GGGGGAACCT TATTGGGAGG TGGCACAACT GCTACAGCTA TCAGTCGAGC TGCCTCTGGT CGATCGCCCT TAAAACAACC TAATATTTTG ATCGTCATTG CCGATCAACT TCGTTATCCC GCCTGGTTTC CTAACCAAGC CCAACTGGAT CAAATTTTGC CCAATTTAGG GAGCCTGCGC CGTCGGTCTG TAACGTTCAG AAATTATTAT GCAGCGGCAA CGGATTGCAC TCCAGCCCGC TCCACCTTGC TGACCGGACT CTATACCCAT CAGACAGGGA TGTTTCTCAC CATTGTGGCA GGAGGGAGCC CCCAACCCCC AGGTACACTG ACAGAACCCA CACTTGATCC AGGGTTTCCT ACCTGGGGGG CAGCATTGAG AACCTTTGGT TATTCAACCT GGTGGTTTGG TAAATTTCAT GAAAACAACT ACAATTCCGG CACTCTAGAA CCCTACGGTT TTTCCGGAGG AACCTGGCCC GACCCCTATG GTTTCCCTGG TGAGGGAGAA AAAGGTGATC CCTGTATTAC GGATCAATTT TTAGGCTGGC TTAAATCTCC TCAATCCGCT AATCAACCCT GGTGTACCAC GGTCAGTCTG GTTCAACCCC ACGATATTGC CTATTATTTC AAGTTAACGG ATTGTCCCAA TGTGAACCAG GGAACTAGCC TGCGGCTGAT CAAGCAACTT CCCGCCAACT TTGAGACTCC TGATAGCCTG AAGACCAATA AGCCTTCGCT GCAATCTCTG TTTCTGCAAC AACAAATTAA AGGATTAGGG CTCATCCCCT ACAACGGGCC AGGCTTTGAA CGGGAATGGT TGGAACTGCT GAATCTCTAC CTGGTGTTTC AGAAAGAACT AGATGTCGAA GTTGGACGCA TTCTGCAGGC GATCGGCAAC TCCCCCTATT CCGATAACAC TGTGATTATC TTCCTCTCAG ATCATGGCGA GTTAGGCGGT TCCCACGGTC TCCGCGGTAA GGGTTCTTGT GTCTATGATG AGTCAATGCA GACCCCACTC TATATCTATG ATCCGACTGG TCGATTTGCC AAATCCCCCT CTACAGAGCG TAAACAGTTG GTTTCCAGCG TGGACATTAT GCCCTTATTG CTTACCCTGG CTAATAATGG CAATGCTCAA GCCTGGTACA GCCGATACCC TTATCTGTCC AAGCGATTGA ATATCCTGCC CATTCTCCTC AACCCAGCAG CTCGGGGACG GCAGTACGTT CTCCATACCA GTGATGAAGG CCCTATTGCC GGGATTGATC CGACCGATCC CAGTACTCCC CCCAGTCATG TGATTGGCTA CCGCACCGCT CGGGCAAAGT TGGGGCTATA TAGCCGCTGG AGAAACAATA GTTCCAATTT AGCAATTATC GCCCGAGGTC AGGAAACTGA ATTGTATGAC TACACCCAAC AACAGCCCAG TCCAGGGCTT TGGGAGATCA ACAATGTTGC CAGTACTAAC CCTTCTCTCT TAAAGCAGTA CTATGGGGCT TTGCAACTGG CCATTACAAA TGAGTTACGG GCCCCTCTGC CCGGCCGCTT GAAATCGTTT CAACGGCGTG CTTTTCATAG ATACTGGCAA TACATTAACC ACCCCTCCTC TTTTCGACCT GCATGTTTTA ACCCTTCCCA AGCTCCCTGT CCTAGTCGTT CTTGA
|
Protein sequence | MHKDSNSSSR RGFLKKAALM GGTLLGGGTT ATAISRAASG RSPLKQPNIL IVIADQLRYP AWFPNQAQLD QILPNLGSLR RRSVTFRNYY AAATDCTPAR STLLTGLYTH QTGMFLTIVA GGSPQPPGTL TEPTLDPGFP TWGAALRTFG YSTWWFGKFH ENNYNSGTLE PYGFSGGTWP DPYGFPGEGE KGDPCITDQF LGWLKSPQSA NQPWCTTVSL VQPHDIAYYF KLTDCPNVNQ GTSLRLIKQL PANFETPDSL KTNKPSLQSL FLQQQIKGLG LIPYNGPGFE REWLELLNLY LVFQKELDVE VGRILQAIGN SPYSDNTVII FLSDHGELGG SHGLRGKGSC VYDESMQTPL YIYDPTGRFA KSPSTERKQL VSSVDIMPLL LTLANNGNAQ AWYSRYPYLS KRLNILPILL NPAARGRQYV LHTSDEGPIA GIDPTDPSTP PSHVIGYRTA RAKLGLYSRW RNNSSNLAII ARGQETELYD YTQQQPSPGL WEINNVASTN PSLLKQYYGA LQLAITNELR APLPGRLKSF QRRAFHRYWQ YINHPSSFRP ACFNPSQAPC PSRS
|
| |