Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_05875 |
Symbol | |
ID | 9296671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | - |
Start bp | 1316462 |
End bp | 1319332 |
Gene Length | 2871 bp |
Protein Length | 956 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | |
Product | Peptidase S8 and S53, subtilisin, kexin, sedolisin |
Protein accession | YP_003715937 |
Protein GI | 298207758 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTTAGCC TAAGCACAAC CCTGTTATCG CAAGAATATA AAAGCTTTTA TTTAGAACTA GCTTCTGGCA TTGGCGTGGA TTTACTCGAT GGGAGCCAAA TAGTTATACA ACCCGACCAA ACCATAAAAC TAGTTACAAA TCAAACACAT CCTATTTTTG ATTTCCTAAA TTCGAAACCT TTCTATTCTA TAGAAAAAGC ATTTCCAATG ACTTCACTTC CTCGACTTAA AAAAGTATAT ATAATATCTA TAAAAGATGA TACTTTAGTG ACTGACTTTG TTAATAACAC AGGAATCACT TACTTTAAAG AGATAGATGA TGATAATAAT ATATTAACTA GCACTACAAC AGCAGTTAAC AACACACAAA TTTCGATACC AAATGATTAC GAAGACTTAC TTTATGGCGG CAATAATGCT TCTTTAGAAT TAATTAATGC GCCATTAGCT TGGGCAATTA CAACTGGTGA TTCGAACATT TTGGTAGGAG TAGTAGACTC ATCCTACGAG TTAAATCATC CCGATTTGAC CGGGCAAATA GTTAACAATA TTGAAGTCGT GCCAAGTAAT TTTTCTCACG GAACAGCTGT AGCGAGCATC ATAGCAGCAA ATACCGATAA CAATGAAGGA CTATCAAGTC TTGGTAGAGA TTTAAAGTTA GTAACTGCTA CCTGTGGAGG TGGAGGTAAC AATCTAGTTT CGGGTTTGGA AGAATTGGCA GATTATCCTG GTGTAAGAGT AATTAACTGT AGTTGGCGTA TTTGCGAAGG CTCTCCTGTC AAAACTTATC TGGACGATGT TATGGATTAT GTCCGTACAA AAGACATTTT AGTTGTAGCA AGTGCAGGCA ATGGTGTTCA GGGAGATGGT ACTGTAAAGG GTTGCGATTC AGACGGCAAT GGTTATGGAT ATCCTGGCTC CTATGATGAT GCGATTTCTG TTACAGGCGT TGGCCATAGA TTTTCTATAG GCACTTTACA GGACAATATT CCTGGCCAAA CTAAATCTTG GATTGACGTT CATAGGCAGC TACCTGACAA CCCTAACAAT TTTGCATCTC ATACGCACAA TGACAAAGTC AATGTTTCAG CTCCTGGTAT TTACGTGCTA TCTGCCACGG AAACATCCAA CAATCAAACT GGATATACTG CAGGTATCGG TACGTCGTCA TCAACGCCAC TTGCTTCAGC ACTGGCTGGA TTAATTTTCT CCATTAATCC AAATTTCACA GCAACACAAG TTAAAGATAT TATTGAATCT ACGACAGATA ACATATATGA CATAGATTTA AATCAACCCT ATATTGGCGA ATTAGGAACA GGTCGCATCA ATGCCTATGC TGCTGTTTTA AAAGCGCAAT GTTTAACCAA CCCTGAAAGC GGACTCGATC TTGCAATGCA AAATTCTCAA CTCGACAATT TTGAAGAACC CGACACCTTA ACAACACAAC CATACCGAAG CAAGGATATA TGGGTACGCA ATCAAGATGA TGGATTATTA GTAAAGGAGC ATCAGAATCC TGAATATGAC TCTAACAACC CCAATTACGT TTACGTAAGA GTAACCAATA GCAGTTGCGA TAGCTCTACA GGATCAGATC CCTTAAAGCT CTATTGGGCT AAAGCCAACA CAGCTTTATC TTGGCCAGAC CATTGGGATG GCAGTTTAAC AATGACAGAT CCTATTACAG GAGAAGATAT TCTTATGGGT GACCAGATTG GAACAGTTAA CATTCCACCT TTAGATATTG GGCAGGAAGC CATATTGACT TTTGAATGGC TAGTGCCTAA TCCCGAAGAT TATGAAAATA TCAACCCCAA TCCCTGGCAT TTTTGCCTTT TGGCTCGCAT AGATACGCCT AACGACCCTA TGACTACCCC TGAAAACGAC AATATTGTCT TGAATGTGTG GAACAACAAT AATATTGTAT GGAAAAACAC TACTGTTGTA GATGTCGTTG AAAATAGTTC TAATTATGGT GGTGTAGTCG CTGTGGGTAA TTACAGTAAT CAACAACAAT CATTTGATCT CGTTTTTGTT AAAGATGCCA ACGAAAAAGG GAAAGCTATT TATGAAGAAG CAGAAGTAGG TTTAGAAATG GACACCACCT TATATAATGC TTGGCAAGAT GGTGGAAAAA GCCAATCGCA ACTAAAATCT ACCAAAGATG ATTATAAGAA AATTCTCACC ACCAACAATT CTAAACTAGG CAACATTATA TTAGACGCTG GAGAAATTGG GACGTTATAC GTTTCCTTCA ACTTTTTTAC TAACCAATCT ACATCAAAGG ATAAATTTGC ATTAAGTGTT TTGCAACAGG ATGCAACTAC AGGTTCTCTT ATTGGTGGTG AAGAATACAT TATAAACAAA AAAACCAGCA GTAGTTTTAG GGCGAATGCC GGTGAAGATC AAGATATAAA AAAGGATGAG TTTATAGTAC TTAGTGCTAC ACCCATAAAT GAGGATGTCA CTTTCAATTG GTATGACCAA AATAATACAC TCCTCTATTC TGGTGAAAAT GTATTGGTTA ATCCCGAATT CACTTCAACC TATAAATTAG AAGTGATTTC AAACCTCAAC GGATTTAAAG ATTATGACCA TGTAGACATT ACTGTTAATC CTTTTTATCT CGAAAGTCTA ATACCCAATC CTGCAAACAA CCTAGTTACA GTGCAATATA AAGTCGATGA GGCAAGTTCT GCTTACCTAT CAGTAGTTAA TACTGCAACA GGACAACATC ACAACTATAT ACTCGATACC ACTCAAACCA GTAGACTTTT AGATATTTCT AATTTGAGTT CGGGGATTTA CAGTATAATA CTAGTGTGTG ATGGCGAAAT TCAAGACTCA TTAAATCTTT CAAAACTATA A
|
Protein sequence | MLSLSTTLLS QEYKSFYLEL ASGIGVDLLD GSQIVIQPDQ TIKLVTNQTH PIFDFLNSKP FYSIEKAFPM TSLPRLKKVY IISIKDDTLV TDFVNNTGIT YFKEIDDDNN ILTSTTTAVN NTQISIPNDY EDLLYGGNNA SLELINAPLA WAITTGDSNI LVGVVDSSYE LNHPDLTGQI VNNIEVVPSN FSHGTAVASI IAANTDNNEG LSSLGRDLKL VTATCGGGGN NLVSGLEELA DYPGVRVINC SWRICEGSPV KTYLDDVMDY VRTKDILVVA SAGNGVQGDG TVKGCDSDGN GYGYPGSYDD AISVTGVGHR FSIGTLQDNI PGQTKSWIDV HRQLPDNPNN FASHTHNDKV NVSAPGIYVL SATETSNNQT GYTAGIGTSS STPLASALAG LIFSINPNFT ATQVKDIIES TTDNIYDIDL NQPYIGELGT GRINAYAAVL KAQCLTNPES GLDLAMQNSQ LDNFEEPDTL TTQPYRSKDI WVRNQDDGLL VKEHQNPEYD SNNPNYVYVR VTNSSCDSST GSDPLKLYWA KANTALSWPD HWDGSLTMTD PITGEDILMG DQIGTVNIPP LDIGQEAILT FEWLVPNPED YENINPNPWH FCLLARIDTP NDPMTTPEND NIVLNVWNNN NIVWKNTTVV DVVENSSNYG GVVAVGNYSN QQQSFDLVFV KDANEKGKAI YEEAEVGLEM DTTLYNAWQD GGKSQSQLKS TKDDYKKILT TNNSKLGNII LDAGEIGTLY VSFNFFTNQS TSKDKFALSV LQQDATTGSL IGGEEYIINK KTSSSFRANA GEDQDIKKDE FIVLSATPIN EDVTFNWYDQ NNTLLYSGEN VLVNPEFTST YKLEVISNLN GFKDYDHVDI TVNPFYLESL IPNPANNLVT VQYKVDEASS AYLSVVNTAT GQHHNYILDT TQTSRLLDIS NLSSGIYSII LVCDGEIQDS LNLSKL
|
| |