Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_12953 |
Symbol | |
ID | 9298081 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | - |
Start bp | 2807184 |
End bp | 2808713 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | Sulfate permease family protein |
Protein accession | YP_003717332 |
Protein GI | 298209153 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.215491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCATCAATTT ATTCGATTTT TCACAGAAGG TAAATTATAA GACAGAAATT CTTTCAGGCT TAACAGTTGC TATGGCACTA GTTCCAGAAG CAATAGCGTT TGCCCTTATT GCAGGACTGT CTCCTCTAAC CGGACTATAC GCAGCATTTG TTCTTGGGTT AATAACCTCA ATTTTTGGAG GAAGACCAGG AATGATTTCT GGAGCTACTG GTGCAGTAGC AGTGGTTATA GTATCCTTGG TGCAATCTCA TGGTGTTGAG TATGTTTTTG CTACGGTAGT TTTGGCAGGT CTTATACAGG TACTTGCAGG CGTTTTTAAA TTAGGTAAAC TAATGCGATT AGTACCGCAT CCTGTAATTT TTGGATTTGT AAACGGTTTA GCCATAATCA TATTTATGTC TCAGCTTACG CAGTTTAAAG ATGCTAGCGG AGATTGGCTT ACAGGAACGT CTTTATATGT GTTACTTGGA TTGGTACTAC TTACAATGCT AATTATTTGG GGGCTTCCAA AGCTAAGTAA AGCTATTCCT GCTGCACTTG TAGCTATTTT GGTAGTCTTT GGTTTAGTGG TGGTGTTTGG GTTAGATACG CGCACTATTG GTGATATTGC ATCAATTGAA GGTGGTTTTC CGCCGTTTCA CTTACCTGCA GTTCCTTTTA ATATAGAAAC TTTAGCTATT ATATTTCCAT ATGCTGCTGT AGTTGCTGGT GTTGGACTTA TTGAAAGTCT ATTAACACTA AATATCGTTG ATGAAATTAC AGAAACTCGT GGTAGTGGTA ATAAAGAAGC CGTAGCGCAA GGTGCTGCAA ATATATTATC TGGTGTATTC TCAGGAATGG GTGGTTGTGC TATGATTGGA CAAAGTTTAA TTAACATTTC TAATGGTGCA AGGGCAAGAT TGTCTGGAAT TGTAGCGTCT GTGATGCTGC TTATATTTAT TATGTTTGGC GCTGGTCTTA TAGAGCGTTT GCCAATGGCA GCTCTTACAG GACTTATGAT TATGGTATCT ATAGGAACTT TTGAATGGGC CAGTTTAAGA ACATTTAGAC GTATGCCAAA ATCAGATATT TTCGTGATGG TATTAGTAAC CTTAGTTACT GTGTTTTTAC ACAACCTAGC ACTAGCTGTA GTTATTGGTG TTATTATTTC TGCACTAGTG TTTGCTTGGG ATAATGCAAA GCGTATTAGA GCTAGAAAGC ACATTGCTAA TGATGGTACA AAACATTATG AAATCTATGG ACCTCTGTTC TTCGGAAGCG TTCAAGCCTT TAATGATAAG TTTGACATTC TTAATGATCC TGAGCAAATT GTAATAGATT TTGCAGAGAG TAGAGTTGTA GATATGTCGG CTATTGAAGC TTTAAATAAA ATTACAGAGC GTTATCAAAA GGTAGGGAAA AAGGTACACC TTAAGCATCT TAGTGAAGAT TGTAGAACAC TTCTTGCAAA CGCAGAAGAT CTTATTGAAG TGAATGTGTT AGAAGATCCT ACGTATAAAG TCCTTACAGA TAAGGTTTAA
|
Protein sequence | MKKIINLFDF SQKVNYKTEI LSGLTVAMAL VPEAIAFALI AGLSPLTGLY AAFVLGLITS IFGGRPGMIS GATGAVAVVI VSLVQSHGVE YVFATVVLAG LIQVLAGVFK LGKLMRLVPH PVIFGFVNGL AIIIFMSQLT QFKDASGDWL TGTSLYVLLG LVLLTMLIIW GLPKLSKAIP AALVAILVVF GLVVVFGLDT RTIGDIASIE GGFPPFHLPA VPFNIETLAI IFPYAAVVAG VGLIESLLTL NIVDEITETR GSGNKEAVAQ GAANILSGVF SGMGGCAMIG QSLINISNGA RARLSGIVAS VMLLIFIMFG AGLIERLPMA ALTGLMIMVS IGTFEWASLR TFRRMPKSDI FVMVLVTLVT VFLHNLALAV VIGVIISALV FAWDNAKRIR ARKHIANDGT KHYEIYGPLF FGSVQAFNDK FDILNDPEQI VIDFAESRVV DMSAIEALNK ITERYQKVGK KVHLKHLSED CRTLLANAED LIEVNVLEDP TYKVLTDKV
|
| |