Gene CA2559_03565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_03565 
Symbol 
ID9296200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp814075 
End bp817539 
Gene Length3465 bp 
Protein Length1154 aa 
Translation table11 
GC content38% 
IMG OID 
Productthermolysin 
Protein accessionYP_003715477 
Protein GI298207298 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATT TTTATGGAAG TGTATTTTTC GCTCTTTTAT TCTGTGCTCC AATGCTACAA 
GCACAAACAG ATCGAAGAGA CGAAAAACGC AATGTAAAGC AACCGCCTAA ACTAATTGTT
TTAGACGATA CTCAAAGAAA ATCATCTAGC CCTCAAGATG TACTAGGACA AGTTTATAAC
CTAGGCCAAA ACAATCAACT TACTCCTCGA AAGCAATCTA CAGATAATCT CGGATTTTCA
CAGCAAGAAT TTCAACAACG CCATCAAGGC ATTCCTGTTG AGTTTGCAAT TTCAAAACTA
CATTCTAAAC AAGGTATCCC ACAAGCAGTA AGTGGTGAGT ATTATCCTAT AAAAGATTTA
GATGTTACAA CAACATTAAC TAACCAACAA GCATTAATGC GCGCTGTTAA CCACATAGGT
GCGCAACATT ATTTATGGGA ATATCCAGAT GCAGCTGCAG AAATGGATGG TTACCAAAAA
CCTACAGGTG ACCTAGTAAT TTTACCAATG TATAATGGTG ATGAAATTTC CACTTACAAG
TTAGCCTACA AATTTGATAT TTATGCAACG TATCCAATTA GTCGTGGAGA TCTTTATATA
GATGCTAAAA ATGGAGACGC ATTATTTTAC AATGCAACTA TTAAACACGC AAACTCTTTT
GGTCACGTAG GTGAGCCAAT GGTAACTGTA TCTCAATTTG AAGACGATAC AACGTATGAT
AATTATACAA GCATAGCTAT GATGGCTAGT GGTACTGCTG CTACACGTTA TAGTGGTTCT
CGTACTATTG AAACTCAACT TACTGGTGGT AGCTATAGGT TAGCAGATAC TGGTAGAGAT
GTTTATACTC GTGATGCTAA AAATCAGGCG CCAGGAGGCA CGTATCCTTA CATTAATAAT
TATGATGAGT TTACAGACAA TGATAATAAT TGGACAACTG CAGAGCATAG CGCAAATAAA
GATAATGCGG CTTTAGATGC TCATTGGGGT GCAATGGAAA CGTATGATTA CTGGCAACAA
GTTCACGGTC GTGATAGTTA CAACGGTAGT GGTGCACAAA TTAGAAGTTA TGTTCACGTA
GATAACAATT ATGATAACGC ATTCTGGAAT GGTTCTGTAA TGTCTTACGG TGATGGTTCT
TCGAATGGTA ATGAAGGAAA TGGATTTTTT GATGCGTTAA CAAGTATAGA TGTTGCATCT
CACGAAATAG GACATGCTGT AACAACGTTT ACTGCAAATC TTGCATACCA AAGAGAATCT
GGTGGTCTTA ATGAAGGCTT CTCAGATATT TGGGGTGCAG CAGTAGAGCA TTTTGCAAAA
GGAAATGGAA GTGATACTAA TCCAACTGAT GAAGTTTGGT TAATTGGAGA TGAAATTGAT
AGACGAAGCG GCTCTGCAGC TCTACGCTCT ATGAGTAATC CAACATCATT AGGACAACCA
GATACTTATG GAGGACAATT CTGGCAAAAC CCTAATTGTG GAACACCAAC GCAATCTAAT
GACTATTGTG GAGTACATAC AAACTCTGGT GTATTAAACT ACTGGTTTTA CTTATTAGTT
GAAGGTGGCA ATGGTACAAA TGACGTTGGC GATGTGTTTT CAGTAAGTGG AATTGGTATG
GATAAATCTG CTAAAATTGC CTACAGAACT TTAAATAACT ATTTATCTGC AAACTCTACG
TTTGCAAATG CAAGAGCTGG CGCAATACAA GCGGCTAAAG ATCTTTATGG AGCTGGTGGC
GCAGAAGAGC AGGCAGTAAC AAATGCTTGG CATGCCGTTA ATGTAGGTGA TGCTTTTGGT
GGAGGAAATG GTGGCTCTAA CTATTGTGCC TCTAATGGTA ACAGTGTTGC AGATGAGTAT
ATCTCTAGAG TACAATTGGC AGATATTAAT AATACTTCTG GTGCCGGAAG CGGTGGATAC
CAAAATCATA CAGCTGTGGA AACAGACTTG GCAAAAGGTG ATGTATATAC AATTACAATT
ACACCTACCT GGACAGGAAC TGTATATTCT GAGGGCTATG CCGTTTGGAT AGATTATAAC
CAAGATGGTG ATTTTTCTGA TTCTGGAGAA TTAGTTACAA GTGTTGCAGC TACACAAAAT
ACACCTGTAA GTGGTAGCTT TACAGTGCCA ACCAATGCTT CAGATGGAGC TACTCGTATG
AGAGTTTCTA TGAAATATAA CGGTGTGCCT AGCGCTTGTG AATCATTTAG CTATGGTGAA
GTAGAAGATT ATACAGTTAA TATTGGAAGC TCTGCTGCAG ATACTCAAGC ACCTAGTGTA
CCAACAAATT TATCTGCTTC TAACATCACA GAAACTACAG TAGATTTATC TTGGAATGCT
TCAAACGATA ATGTTGGAGT AACAGGATAT GATGTTTATC AAGGAACTGC TTTATTAGGA
ACTACAGCTA ATACATCTGC TCAAATTACC GGATTATCTT CTGGAACTAA TTATACATTT
AATGTTCGCG CAAAAGACGA CGCAGGAAAT GTATCTGGAG CTAGTAATAC TGTGTCTGTA
ACTACAGACT CACCAGCATC TGGTGGTGGT TGTGTAAATG GTATTTCTTC ATTCCCTTAT
TCAGAAGGTT ATGAAAGTAA TTTAGGAGCT TGGACGCAAT CTACTTCAGA CGATATAAAC
TGGACTAGAG ACGCTAGTGG TACACCATCT AGCAATACAG GACCATCAAG TGCGGTTGAA
GGTAGTTTTT ATGTTTTTGT TGAAGCTTCT GGAAACGGTA CAGGCTATCC TAACAAACAA
GCAATTTTAA ATTCACCTTG TATAGATTTA TCTGCTACTG CACAACCAAA TTTTACATTT
AAGTATCATA TGTATGGTGC ATCAGATATG GGAAGTTTAA AAGTAGATGT TAGTGATAAT
GATGGAGCAA CTTGGACAAC GTTATTCTCC GAAACAGGAA ATAAAGGAAA TGCTTGGCAA
TCTGCAACTG TAGATCTTTC AGCTTATAAT GGATCATCTA TTAGACTAAG GTTTAATAGA
ACTACAGGCT CTACGTGGCA AGCAGATATC GCTATTGATG ATGTGAGAGT TATAGATGGT
ACAGCAGACA TTTGCGCAGG TGTTGCACCT TATAGCAGTT CTCAATCTTA TAGTACTGGA
GATCGTGTTA CGTATCAAGG TAATTTATTT GAAAGAACAG CTAGTGGTTG GACAAACTTG
GGAGCTTGTG GATCTTCTTT TGGAATCACA AACTTAGTGC CTCAGGCGCC ACCAGTTAAC
TCAACATTTA ATATTTACCC TAACCCTGTA AAAGGAACAT CTTTACAAGT AGACTTTGTG
TCTCAAAAAG AAGCAACATT TACAATAGTT AATATGTTAG GACAGCAAGT TGCAAAAGGA
AACTTGAGCA ACACGTTAGA AGTAGGAAAT CTAGAAAGTG GTATCTATTT ACTAAATGCT
ACTATAGATG GACAAACTAT AACAAAACGC TTTATTAAAG AGTAA
 
Protein sequence
MRNFYGSVFF ALLFCAPMLQ AQTDRRDEKR NVKQPPKLIV LDDTQRKSSS PQDVLGQVYN 
LGQNNQLTPR KQSTDNLGFS QQEFQQRHQG IPVEFAISKL HSKQGIPQAV SGEYYPIKDL
DVTTTLTNQQ ALMRAVNHIG AQHYLWEYPD AAAEMDGYQK PTGDLVILPM YNGDEISTYK
LAYKFDIYAT YPISRGDLYI DAKNGDALFY NATIKHANSF GHVGEPMVTV SQFEDDTTYD
NYTSIAMMAS GTAATRYSGS RTIETQLTGG SYRLADTGRD VYTRDAKNQA PGGTYPYINN
YDEFTDNDNN WTTAEHSANK DNAALDAHWG AMETYDYWQQ VHGRDSYNGS GAQIRSYVHV
DNNYDNAFWN GSVMSYGDGS SNGNEGNGFF DALTSIDVAS HEIGHAVTTF TANLAYQRES
GGLNEGFSDI WGAAVEHFAK GNGSDTNPTD EVWLIGDEID RRSGSAALRS MSNPTSLGQP
DTYGGQFWQN PNCGTPTQSN DYCGVHTNSG VLNYWFYLLV EGGNGTNDVG DVFSVSGIGM
DKSAKIAYRT LNNYLSANST FANARAGAIQ AAKDLYGAGG AEEQAVTNAW HAVNVGDAFG
GGNGGSNYCA SNGNSVADEY ISRVQLADIN NTSGAGSGGY QNHTAVETDL AKGDVYTITI
TPTWTGTVYS EGYAVWIDYN QDGDFSDSGE LVTSVAATQN TPVSGSFTVP TNASDGATRM
RVSMKYNGVP SACESFSYGE VEDYTVNIGS SAADTQAPSV PTNLSASNIT ETTVDLSWNA
SNDNVGVTGY DVYQGTALLG TTANTSAQIT GLSSGTNYTF NVRAKDDAGN VSGASNTVSV
TTDSPASGGG CVNGISSFPY SEGYESNLGA WTQSTSDDIN WTRDASGTPS SNTGPSSAVE
GSFYVFVEAS GNGTGYPNKQ AILNSPCIDL SATAQPNFTF KYHMYGASDM GSLKVDVSDN
DGATWTTLFS ETGNKGNAWQ SATVDLSAYN GSSIRLRFNR TTGSTWQADI AIDDVRVIDG
TADICAGVAP YSSSQSYSTG DRVTYQGNLF ERTASGWTNL GACGSSFGIT NLVPQAPPVN
STFNIYPNPV KGTSLQVDFV SQKEATFTIV NMLGQQVAKG NLSNTLEVGN LESGIYLLNA
TIDGQTITKR FIKE