Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CA2559_09051 |
Symbol | |
ID | 9297295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Croceibacter atlanticus HTCC2559 |
Kingdom | Bacteria |
Replicon accession | NC_014230 |
Strand | + |
Start bp | 1984340 |
End bp | 1987294 |
Gene Length | 2955 bp |
Protein Length | 984 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | thermolysin |
Protein accession | YP_003716557 |
Protein GI | 298208378 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTAA ATTACGCCTT AACTTTTGTT TTAGCCTTAG GAACTCTGGT TGCTAACGCA CAGGATAAAA ACAAATTAAG GGATAATGTG GCTCATCAAA CTATGATGAC AGTTAGTGGC CACCAGTCCC AGCAAGAAGC CATAGCTAAT TTTGCCAAAT CTTATGAACT TGATGAAAAC TCAACTTTTC AAGAGTACAG AACTACAACA GACAAGTTAG GAAATACTCA TAGTAGATAT CAGCAATATT TTAATGGTAT TAAGGTACAG TTTGGTGTAA TGATAGTTCA CAACAAACAA GGTAGCGTCT CTTTAATAAA TGGTGAATTA TACAATCCAA AAACTATTAA TACTGTACCG AGCCTCTCTA AAGAAGTAGG TTTACAACGT GCAATTGAGC ATACTAATGC TAACGCTTAC TTGTGGGAAG ACGCTTCACA AGCGGCTCTT ATGGGTTACT CAAAACCCGT AGGTGAGTTG GTAATATTTC CAGATGTAAA CAAAGGTATT GTTCATTTAG CGTATATGTA TGATGTATAT TCTACTTCAC CTATATCTCG TAATGAAGTT TATGTAGATG CACACTCTGG AGATATTTTA TTTAAGAACC CAATTATAAA ACACGCAGAC CGTTTAATTT CTAATAATGA AATTTCAGCA AAGGCAAAAG AACTTGAGAA TACGTTAAAC TCTGCGCTTG TTGAAGGAAC TGCAGATACT CGTTACAGTG GTACAAGGCC AATTGAAACA ACTCCTGAAG GATCTGAATT TATACTGTTA GATACAACAA GAGGTGACGG AATTGTTACT TACAATTGCG AAGGTTTTAA TGCATATCAA GATTTACATT TTTCTGATAA TGATAACGCT TGGACAGCTG CAGAATATGA TAACGATGAA AAAGATAATG GTGCATTAGA TGCCCATTGG GGAGCAGAAG TAACCTATGA TTTCTGGCAA GATATATTTA ACAGAAATAG TTTTGATGAT GAAGGAGCTC AAATTATAAG CTACGTACAT TACGATGATG CAGACGAACC TTTGTTTTTA GATACAGGTT ATGATAACGC CTTTTGGAAT GGATCTGTAA TGACATATGG AGACGGAAAT ACTTTTGATA TTCTTACAGC GGCAGATGTA TGCGGTCACG AAATAGGTCA TGCTGTATGT ACATTTACTG CAGATCTAGC TTATCAAAAT CAATCTGGAG GTATGAATGA AGGGTACTCA GACATTTGGG GTGCTTGTGT AGAGCATTTC GGAAGAACAG GTTCTATAGA TGGTGCTATT GATCCTAATG TGTGGTTAAT AGGTGAAGAC CTTAATTCAC CACCATTACG CTCTATGAGT GATCCTAATT CAAGAAATGA TCCAGATACT TATTTAGGAG ATAACTGGGT ATCTACTGGA GATGAAGGTA CTTGTGTGCC AGATGCAACA ACAAATGATT ATTGTGGTGT TCACACCAAT AGTGGTGTTT TAAATCACTG GTTTTATATC TTGACAGAAG GTGCTGCAGA TACAAATAAT GCGCCAACAC CAGATACTTA TGATGTTGCA GGAATAGGTA TGGTTAAATC TGCAGAGATT GCTTATTTGG CAGAACGTGA TTATTTAACT CCAAATGCAA CATATTTTGA TGCTCGTAAC GCAACTATTG CTGTAGCTAG CTCTATATAT TGCGCAAACA GTCCAGAAGT TATTTCTGTA ACTAATGCTT GGTATGCCGT AAACGTAGGC GAAGAATTTG CAGAAGCAGC AGATGATGTG TCGTTAATGT CTATTACAAA CAATACAGAA GTTGGGTGTG ACGCAGGTAC ATTTAGCCCT CAAATTGTAG TTTCTAATGG AGGTACAAAT ATGTTGGGAG ATGTAGACAT TTCTTTTTCA GTAAATGGTG CAGCTACTAC AACAGTTACT GAAACAGTTA ACTTGGGAAC TTGTGAGTCT ACTACTCTAA CTTTAGATAT TGGATCTTTA ACTCGTGGCG CAAATGTTTT AAACGTAGAT GTAACTACAA CAAACGATGG TCGTCCAGAA AACAATACAG GAACAGTATT AGTAGTTGTA AACGATGCTG CTGAAGTAGA TGTAGTAAAT ACTTTTGATA CTGTTGAAGA GTCTTTAATC TCTTATAATG ATGGTGTTAT TTCAAGCTTA TGGGAAAGAG GAGTAGCTCA AGGAACGCTT TTAAGTGATG CAGTTGCTGG AGGATCTGAT GTTTACGGAA CTAATTTAGA TGGAAACCAT CCAGATGGTA CTATAGCTTA TCTTGTTTCA CAATGTTATG ATTTATCTTC TGTAGAAAAT GCTATATTAA AATTTGATAT GGCTTTCGAT CTTGAAGAAA ATTGGGATAT TATGTATATG CAGTATACTA CAGATGGTGG TTCAACTTGG CAAACTTTAG GAACAGCATC AGATGCTAAT TGGTATAATA ACGATAGGCT TCCAGACGGT ACAGATTGCT TTAATTGTAT TGGTTCTCAG TGGACAGGTG AAGGAGAAGA CGCACACTCA GGTGGAGGTT TAAACGCTAC AATGCACGAA TATAGTTACA GCCTTAGTGC TTTTGATAAT GATGGTAGTG CCGAAACTAG TATGGTGTTT AGGTTTGTAT TTCAGTCTGA CGCTGCTGTA AATGAAGAAG GTGTAATAGT AGATAATTTT GTAGTTGAAG CAGACCAGAT TGTACTTGCT ACTACAAATA ATGAGTTTAA AGGATTGTCT ATATATCCTA ACCCTACAAA TACTATTGTA AATATTTCTG GACAAGATCT TAAAGATGCT AAAGTTTCAG TAATCGATTT AAGCGGAAGA GTAATTTCTA ACAATGCGTC TGTATTTAAT GGTAATGTAT TGCAAGTTAA TATGGAGCAG TTAGCTTCTG GTAGTTATTT CTTAGTTATT GAAAATGAAA ATTATAGATC TGTTAAGCAG GTTATTAGAA AATAA
|
Protein sequence | MKLNYALTFV LALGTLVANA QDKNKLRDNV AHQTMMTVSG HQSQQEAIAN FAKSYELDEN STFQEYRTTT DKLGNTHSRY QQYFNGIKVQ FGVMIVHNKQ GSVSLINGEL YNPKTINTVP SLSKEVGLQR AIEHTNANAY LWEDASQAAL MGYSKPVGEL VIFPDVNKGI VHLAYMYDVY STSPISRNEV YVDAHSGDIL FKNPIIKHAD RLISNNEISA KAKELENTLN SALVEGTADT RYSGTRPIET TPEGSEFILL DTTRGDGIVT YNCEGFNAYQ DLHFSDNDNA WTAAEYDNDE KDNGALDAHW GAEVTYDFWQ DIFNRNSFDD EGAQIISYVH YDDADEPLFL DTGYDNAFWN GSVMTYGDGN TFDILTAADV CGHEIGHAVC TFTADLAYQN QSGGMNEGYS DIWGACVEHF GRTGSIDGAI DPNVWLIGED LNSPPLRSMS DPNSRNDPDT YLGDNWVSTG DEGTCVPDAT TNDYCGVHTN SGVLNHWFYI LTEGAADTNN APTPDTYDVA GIGMVKSAEI AYLAERDYLT PNATYFDARN ATIAVASSIY CANSPEVISV TNAWYAVNVG EEFAEAADDV SLMSITNNTE VGCDAGTFSP QIVVSNGGTN MLGDVDISFS VNGAATTTVT ETVNLGTCES TTLTLDIGSL TRGANVLNVD VTTTNDGRPE NNTGTVLVVV NDAAEVDVVN TFDTVEESLI SYNDGVISSL WERGVAQGTL LSDAVAGGSD VYGTNLDGNH PDGTIAYLVS QCYDLSSVEN AILKFDMAFD LEENWDIMYM QYTTDGGSTW QTLGTASDAN WYNNDRLPDG TDCFNCIGSQ WTGEGEDAHS GGGLNATMHE YSYSLSAFDN DGSAETSMVF RFVFQSDAAV NEEGVIVDNF VVEADQIVLA TTNNEFKGLS IYPNPTNTIV NISGQDLKDA KVSVIDLSGR VISNNASVFN GNVLQVNMEQ LASGSYFLVI ENENYRSVKQ VIRK
|
| |