Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPS_2367 |
Symbol | |
ID | 3522074 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Colwellia psychrerythraea 34H |
Kingdom | Bacteria |
Replicon accession | NC_003910 |
Strand | - |
Start bp | 2464714 |
End bp | 2466390 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637284824 |
Product | sulfatase family protein |
Protein accession | YP_269085 |
Protein GI | 71281602 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.158836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGT TTAACTCTAT AAAGGTTGCT TTATGTTTAG CACTCGCTTT AAGCTCTGTA ACAAGTTTTG CAAAAGAACA ACGTCCCAAC ATACTATTAA TTGTCGCAGA AGACATGAGC GCAAAAGTAG GCGCATTTGG TGATACTGTA GCTAAAACGC CGGTATTAGA CGAATTAGCA AAGAGTAGCG TTCGTTACCC TAATACCTTT ACCACCGCAG GTGTTTGTGC CCCTAGCAGA ACATCGCTGA TTACCGGGGT TCATCAAATA ACGGTTGGTG GCCAGCATAT GAGAACTCGT TCGTTTAAAG CATCAAATTA CAGAGCTGTA CCCGCACCTG ACGTTAAAGC TTTTCCTGAG TTATTACGAA AAAGTGGTTA TTACACCTAT GTTTCATCAA AACTAGATTA TCAATTTAGT AATACTTCAC CGCACACAGG TCCATTTACC ATCTGGAATT ACGAAGGTAA AAAACCAACA TGGCGAGGGC GAGAAAAAGA CCAACCCTTC TTTGGTATGT ATCATTTAGA TATTACTCAT GAAAGTCAGT TGTTTCCCAA AAAATCAACG AAGAATAAAA AAAGTGGTTT AGTTAAAAAT TGGATTACAC CTGAACGGGT AGTTATTCCG GCTTATTACC CTGATACACA ACTTATTCGT GAGGGTATAG CACTGCATTA TAATAATATT CATGCCATGG ATACCCAGGT AGGTAAGTTA TTAGCAGAAT TAAAAAAAGA TGGCTTAAGC GATAACACCA TCGTTATTTG GACAACAGAT CATGGCGACT CTCTTCCTCG AGGTAAACGT GAAGTGTATG ACAGCGGTTT AAAGGTGCCC ATGATCATTC ATTGGCCAGA CAAGTATCGT CCAAGTAAAA CAGTAAATGG CAGCATTGAT AGCCAGCTAT TAAGTTTTGT TGATATTGCC CCATCAATTT TAGCTATGGC AAATATTAAT ACACCAGCCT ACATCCAAGG TAAGGCGCGT ATCCCAAATA ATAATGCCAC AAACAAGATA GCTAAACGCG AATACATTTA TGCGTCTAAA GACAGGCTGG ATGAGTTTCC TTTTAGAGAA AGAGCTGTGC GTAATAATAA GTTTAAATAC ATTAAAAACT ATTTGCCCAA TAAACCAGGC GCTACCCATT TAGCTTACCG CGATCAAATG ATATTAATGC AAGATTTATG GCGTGAATTT GAAGCGGGTA GAATGAATAA GCAACAAGCT TTCTGGTTTA ACAATCGACC AGGTGAAGAG CTGTACGACA TTATTAACGA TCCTGAAGAG GTAAATAATT TAGCCGAAAA AGTAGAATAT CAACAACAGC TCAACATTAT GCGTAATGCA TTAAAAGAAT GGCAGTCTCA TGTAGATGAT TTAAGTGATC GCCCAGAAAT AGAGCTTGCT AATGAGTTTT GGCCGAACGG TCAACAACCG ATTACAGCTA AGCCTAGTAT TTATTTAGAT GAGAGTGGCC TCATCGCCAT TAAAGGCAAT ACTCAAGGCT CATCTATTGG CTACCAAACT AGTAACTTAA ACAAAAAAGG TAAATGGATA ACGAGTAAAA TTCGCGTTTA TAATCAACCT TTCGCGGTTG AAAAAGGCAT GAAAATTATC GCTAAGGCCG TTCGTTATGG TTATAAAACC AGTGAAAAAA CTATCAGAAT CTTTTAG
|
Protein sequence | MKQFNSIKVA LCLALALSSV TSFAKEQRPN ILLIVAEDMS AKVGAFGDTV AKTPVLDELA KSSVRYPNTF TTAGVCAPSR TSLITGVHQI TVGGQHMRTR SFKASNYRAV PAPDVKAFPE LLRKSGYYTY VSSKLDYQFS NTSPHTGPFT IWNYEGKKPT WRGREKDQPF FGMYHLDITH ESQLFPKKST KNKKSGLVKN WITPERVVIP AYYPDTQLIR EGIALHYNNI HAMDTQVGKL LAELKKDGLS DNTIVIWTTD HGDSLPRGKR EVYDSGLKVP MIIHWPDKYR PSKTVNGSID SQLLSFVDIA PSILAMANIN TPAYIQGKAR IPNNNATNKI AKREYIYASK DRLDEFPFRE RAVRNNKFKY IKNYLPNKPG ATHLAYRDQM ILMQDLWREF EAGRMNKQQA FWFNNRPGEE LYDIINDPEE VNNLAEKVEY QQQLNIMRNA LKEWQSHVDD LSDRPEIELA NEFWPNGQQP ITAKPSIYLD ESGLIAIKGN TQGSSIGYQT SNLNKKGKWI TSKIRVYNQP FAVEKGMKII AKAVRYGYKT SEKTIRIF
|
| |