Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan7425_1449 |
Symbol | |
ID | 7287371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7425 |
Kingdom | Bacteria |
Replicon accession | NC_011884 |
Strand | - |
Start bp | 1286139 |
End bp | 1287626 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 643584449 |
Product | sulfatase |
Protein accession | YP_002482181 |
Protein GI | 220906870 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.499056 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACA TATTTCGGAA ATCGAGAAGG TTTCTCTTCG GCTTCCTTTT TGCAGTTTTT ACCTGTTGTA TAACCTGGAA GTTGCTCACT CTAAATCAGC AGGATTTGCC CGTGGCGGTA GCTCAGCAAT CCTCCCAACC TCCCCATATC CTCTTTATTA TGTCGGATGA TCAGGGATGG AAAGATGTCG GCTTTCATGG TTCTGATATT CGCACCCCAA ACCTGGACCA ACTGGCTAAG ACCGGCGCAC GGCTTGAACA ATACTATTCC CAGCCCATGT GTACGCCATC GCGCGCGGCC CTGCTGACAG GCAGATATCC CCATCGTTAT GGCCTGCAGA CTTTAGTCAT TCCTTCCGCA GGCAAATATG GTCTGCCTAC CGATGAATAT TTGCTGCCCC AAGCGCTTAA AGAGGCTGGA TACGAGACTG CGATCGTGGG CAAATGGCAC CTCGGTCATG CCGATCCCAA ATACTGGCCC CGCCAACGGG GATTTGATTA TCAGTATGGC CCACTTCTCG GTGAGATCGA CTACTTCACC CATTCAGCTC ATGGCAAGGT TGATTGGTAT CGCAATAACC AGTTGATTAA GGAAGAGGGT TACGTAACGA CATTGCTGGG TCAGGATGCG GTAAAACTGA TCGAAAAACA CAACCCCAAA ACTCCGCTCT TTCTTTATCT AGCCTTTACT GCTCCCCACG CTCCCTATCA GGCTCCACAA AAGTATCTTG ACCAATACAA AACCATCGCT GATCCTAACC GTCGCGCCTA TGCCGCCATG ATTACAGCCA TGGATGACCA AATTGGCCAG GTTGTAGCTG CGTTGGAAAA GCGCGGGATG CGTAACAACA CTCTTATTGT TTTTCAGAGT GATAACGGCG GCCCACGCTC AGCTCAGTTC ACGGGGGAAG TTGATACTTC CGGGGGCACA ATTCCAGCCG ATAATGGTCC CTACCGGGAT GGCAAAGCGT CGCTTTATGA AGGTGGTACC AGGGTGGTTG CGCTCGCTAA CTGGCCCGGA AAGATTCAGC CAGGCACAGT GGTGAACCAT CCAATTCATA TCGTCGATAT GTACCCCACC CTGACAGGAC TGGCTAGTGT TTCGGTCGGT AAAAATAAAC CACTTGATGG CTTGAACATC TGGCCCGCCC TGAGTGAAGC TAAGCCCTCT CCGCGCAGTC AAGTTGTCTA CGATATTGAG CCTTTTCGTG CAGCTCTCAG TCAGGAAGAT TGGAAATTGG TCTGGAAGGC AACTCTACCC TCCCGTCTCG AACTCTTCAA TCTGTCCCAG GATGTTTCCG AGCAAACCAA CCTGGCCGAG CAGAACCCAG AAATTGTGTC CAGGCTAAAA CAACAAATTG AAGTGCTTTC TCGCGATGCT GTTCTCCCAC CCCTGTTCTT AAAGGAAGCT GTTGGTGCAG CAAAGAGTAT ACTGTTTACC TCAGTTTCTA CGCCCGAAGA TTCAGCGGAA ATCGAAAAAC AGCCCTGA
|
Protein sequence | MKNIFRKSRR FLFGFLFAVF TCCITWKLLT LNQQDLPVAV AQQSSQPPHI LFIMSDDQGW KDVGFHGSDI RTPNLDQLAK TGARLEQYYS QPMCTPSRAA LLTGRYPHRY GLQTLVIPSA GKYGLPTDEY LLPQALKEAG YETAIVGKWH LGHADPKYWP RQRGFDYQYG PLLGEIDYFT HSAHGKVDWY RNNQLIKEEG YVTTLLGQDA VKLIEKHNPK TPLFLYLAFT APHAPYQAPQ KYLDQYKTIA DPNRRAYAAM ITAMDDQIGQ VVAALEKRGM RNNTLIVFQS DNGGPRSAQF TGEVDTSGGT IPADNGPYRD GKASLYEGGT RVVALANWPG KIQPGTVVNH PIHIVDMYPT LTGLASVSVG KNKPLDGLNI WPALSEAKPS PRSQVVYDIE PFRAALSQED WKLVWKATLP SRLELFNLSQ DVSEQTNLAE QNPEIVSRLK QQIEVLSRDA VLPPLFLKEA VGAAKSILFT SVSTPEDSAE IEKQP
|
| |