Gene CA2559_09051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_09051 
Symbol 
ID9297295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp1984340 
End bp1987294 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content36% 
IMG OID 
Productthermolysin 
Protein accessionYP_003716557 
Protein GI298208378 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTAA ATTACGCCTT AACTTTTGTT TTAGCCTTAG GAACTCTGGT TGCTAACGCA 
CAGGATAAAA ACAAATTAAG GGATAATGTG GCTCATCAAA CTATGATGAC AGTTAGTGGC
CACCAGTCCC AGCAAGAAGC CATAGCTAAT TTTGCCAAAT CTTATGAACT TGATGAAAAC
TCAACTTTTC AAGAGTACAG AACTACAACA GACAAGTTAG GAAATACTCA TAGTAGATAT
CAGCAATATT TTAATGGTAT TAAGGTACAG TTTGGTGTAA TGATAGTTCA CAACAAACAA
GGTAGCGTCT CTTTAATAAA TGGTGAATTA TACAATCCAA AAACTATTAA TACTGTACCG
AGCCTCTCTA AAGAAGTAGG TTTACAACGT GCAATTGAGC ATACTAATGC TAACGCTTAC
TTGTGGGAAG ACGCTTCACA AGCGGCTCTT ATGGGTTACT CAAAACCCGT AGGTGAGTTG
GTAATATTTC CAGATGTAAA CAAAGGTATT GTTCATTTAG CGTATATGTA TGATGTATAT
TCTACTTCAC CTATATCTCG TAATGAAGTT TATGTAGATG CACACTCTGG AGATATTTTA
TTTAAGAACC CAATTATAAA ACACGCAGAC CGTTTAATTT CTAATAATGA AATTTCAGCA
AAGGCAAAAG AACTTGAGAA TACGTTAAAC TCTGCGCTTG TTGAAGGAAC TGCAGATACT
CGTTACAGTG GTACAAGGCC AATTGAAACA ACTCCTGAAG GATCTGAATT TATACTGTTA
GATACAACAA GAGGTGACGG AATTGTTACT TACAATTGCG AAGGTTTTAA TGCATATCAA
GATTTACATT TTTCTGATAA TGATAACGCT TGGACAGCTG CAGAATATGA TAACGATGAA
AAAGATAATG GTGCATTAGA TGCCCATTGG GGAGCAGAAG TAACCTATGA TTTCTGGCAA
GATATATTTA ACAGAAATAG TTTTGATGAT GAAGGAGCTC AAATTATAAG CTACGTACAT
TACGATGATG CAGACGAACC TTTGTTTTTA GATACAGGTT ATGATAACGC CTTTTGGAAT
GGATCTGTAA TGACATATGG AGACGGAAAT ACTTTTGATA TTCTTACAGC GGCAGATGTA
TGCGGTCACG AAATAGGTCA TGCTGTATGT ACATTTACTG CAGATCTAGC TTATCAAAAT
CAATCTGGAG GTATGAATGA AGGGTACTCA GACATTTGGG GTGCTTGTGT AGAGCATTTC
GGAAGAACAG GTTCTATAGA TGGTGCTATT GATCCTAATG TGTGGTTAAT AGGTGAAGAC
CTTAATTCAC CACCATTACG CTCTATGAGT GATCCTAATT CAAGAAATGA TCCAGATACT
TATTTAGGAG ATAACTGGGT ATCTACTGGA GATGAAGGTA CTTGTGTGCC AGATGCAACA
ACAAATGATT ATTGTGGTGT TCACACCAAT AGTGGTGTTT TAAATCACTG GTTTTATATC
TTGACAGAAG GTGCTGCAGA TACAAATAAT GCGCCAACAC CAGATACTTA TGATGTTGCA
GGAATAGGTA TGGTTAAATC TGCAGAGATT GCTTATTTGG CAGAACGTGA TTATTTAACT
CCAAATGCAA CATATTTTGA TGCTCGTAAC GCAACTATTG CTGTAGCTAG CTCTATATAT
TGCGCAAACA GTCCAGAAGT TATTTCTGTA ACTAATGCTT GGTATGCCGT AAACGTAGGC
GAAGAATTTG CAGAAGCAGC AGATGATGTG TCGTTAATGT CTATTACAAA CAATACAGAA
GTTGGGTGTG ACGCAGGTAC ATTTAGCCCT CAAATTGTAG TTTCTAATGG AGGTACAAAT
ATGTTGGGAG ATGTAGACAT TTCTTTTTCA GTAAATGGTG CAGCTACTAC AACAGTTACT
GAAACAGTTA ACTTGGGAAC TTGTGAGTCT ACTACTCTAA CTTTAGATAT TGGATCTTTA
ACTCGTGGCG CAAATGTTTT AAACGTAGAT GTAACTACAA CAAACGATGG TCGTCCAGAA
AACAATACAG GAACAGTATT AGTAGTTGTA AACGATGCTG CTGAAGTAGA TGTAGTAAAT
ACTTTTGATA CTGTTGAAGA GTCTTTAATC TCTTATAATG ATGGTGTTAT TTCAAGCTTA
TGGGAAAGAG GAGTAGCTCA AGGAACGCTT TTAAGTGATG CAGTTGCTGG AGGATCTGAT
GTTTACGGAA CTAATTTAGA TGGAAACCAT CCAGATGGTA CTATAGCTTA TCTTGTTTCA
CAATGTTATG ATTTATCTTC TGTAGAAAAT GCTATATTAA AATTTGATAT GGCTTTCGAT
CTTGAAGAAA ATTGGGATAT TATGTATATG CAGTATACTA CAGATGGTGG TTCAACTTGG
CAAACTTTAG GAACAGCATC AGATGCTAAT TGGTATAATA ACGATAGGCT TCCAGACGGT
ACAGATTGCT TTAATTGTAT TGGTTCTCAG TGGACAGGTG AAGGAGAAGA CGCACACTCA
GGTGGAGGTT TAAACGCTAC AATGCACGAA TATAGTTACA GCCTTAGTGC TTTTGATAAT
GATGGTAGTG CCGAAACTAG TATGGTGTTT AGGTTTGTAT TTCAGTCTGA CGCTGCTGTA
AATGAAGAAG GTGTAATAGT AGATAATTTT GTAGTTGAAG CAGACCAGAT TGTACTTGCT
ACTACAAATA ATGAGTTTAA AGGATTGTCT ATATATCCTA ACCCTACAAA TACTATTGTA
AATATTTCTG GACAAGATCT TAAAGATGCT AAAGTTTCAG TAATCGATTT AAGCGGAAGA
GTAATTTCTA ACAATGCGTC TGTATTTAAT GGTAATGTAT TGCAAGTTAA TATGGAGCAG
TTAGCTTCTG GTAGTTATTT CTTAGTTATT GAAAATGAAA ATTATAGATC TGTTAAGCAG
GTTATTAGAA AATAA
 
Protein sequence
MKLNYALTFV LALGTLVANA QDKNKLRDNV AHQTMMTVSG HQSQQEAIAN FAKSYELDEN 
STFQEYRTTT DKLGNTHSRY QQYFNGIKVQ FGVMIVHNKQ GSVSLINGEL YNPKTINTVP
SLSKEVGLQR AIEHTNANAY LWEDASQAAL MGYSKPVGEL VIFPDVNKGI VHLAYMYDVY
STSPISRNEV YVDAHSGDIL FKNPIIKHAD RLISNNEISA KAKELENTLN SALVEGTADT
RYSGTRPIET TPEGSEFILL DTTRGDGIVT YNCEGFNAYQ DLHFSDNDNA WTAAEYDNDE
KDNGALDAHW GAEVTYDFWQ DIFNRNSFDD EGAQIISYVH YDDADEPLFL DTGYDNAFWN
GSVMTYGDGN TFDILTAADV CGHEIGHAVC TFTADLAYQN QSGGMNEGYS DIWGACVEHF
GRTGSIDGAI DPNVWLIGED LNSPPLRSMS DPNSRNDPDT YLGDNWVSTG DEGTCVPDAT
TNDYCGVHTN SGVLNHWFYI LTEGAADTNN APTPDTYDVA GIGMVKSAEI AYLAERDYLT
PNATYFDARN ATIAVASSIY CANSPEVISV TNAWYAVNVG EEFAEAADDV SLMSITNNTE
VGCDAGTFSP QIVVSNGGTN MLGDVDISFS VNGAATTTVT ETVNLGTCES TTLTLDIGSL
TRGANVLNVD VTTTNDGRPE NNTGTVLVVV NDAAEVDVVN TFDTVEESLI SYNDGVISSL
WERGVAQGTL LSDAVAGGSD VYGTNLDGNH PDGTIAYLVS QCYDLSSVEN AILKFDMAFD
LEENWDIMYM QYTTDGGSTW QTLGTASDAN WYNNDRLPDG TDCFNCIGSQ WTGEGEDAHS
GGGLNATMHE YSYSLSAFDN DGSAETSMVF RFVFQSDAAV NEEGVIVDNF VVEADQIVLA
TTNNEFKGLS IYPNPTNTIV NISGQDLKDA KVSVIDLSGR VISNNASVFN GNVLQVNMEQ
LASGSYFLVI ENENYRSVKQ VIRK