Gene CA2559_04000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_04000 
Symbol 
ID9296288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp908633 
End bp910240 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content35% 
IMG OID 
Productputative carboxy-terminal protease 
Protein accessionYP_003715564 
Protein GI298207385 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATTTT ACAAAAAATC TTATCTCCCA CTTTTTTTAG GCATAGCCTG TGCTTTTGGA 
ATATTTCTAG GCGCAAAGCT TAACTTCTCT AACGGAAGCG AAAGCCTTTT TGCCTCTAAC
CCTAAAAAAG AAAAACTAAA CAGACTTATA GATTATATAG ACTATGAGTA TGTAGACCAA
GTTAATACAG ATAGCATTGT AGATGTTACT GTAGATGGCA TCTTAGAAAA TTTAGACCCA
CACTCAACAT ATATACCTCA AAGCGAATAC AAAGCAGTTG CCGAAAATAT GAAAGGAGAC
TTTGTAGGGA TTGGTGTTAG TTTTTACACC ATAAGAGATA CAATTTCTGT TATAAATACT
ATTGAAGGTG GCCCAAGTGA AAAAAGTGGT ATTGTTGGCG GAGACAGAAT ACTTTATGCA
GACAATATAA AGCTATTTGG AAATAATGTA ACTAACGATT CTCTTTCAAA TTTTCTAAAA
GGGAAAAGAA ATACAAATGT AACGCTTACG GTTTACAGGC CTAGTGAAGA CAAAACCTTC
AAAACCACAA TTAAGCGAGG TGATGTTCCC CTAAAAAGTG TTGATGCCTT TTATATGCTA
ACAGATGAGT TGGGATATAT AAAAATGAAC CGCTTTGCAG AAAGCACCTA TACAGAATTT
AAGCAGGCAC TTACAACCCT ACAACAACAA GGCGCCACAG AACTGGCTAT AGATTTACGC
GGAAATCCTG GAGGCTATAT TGGTCAAACC ACTAAAATTA TAGATGAATT TTTAGAAGAC
GACAAGCTTA TCCTTTTCAC AAAAAACAAA AAGGGCAAGG TAGAAAACAC CTACGCTACA
AGAAGCGGAA ACTTTGAAGA TGGTGGTATT TATGTGCTTA TTGATGAAAC ATCTGCGTCT
GCATCAGAAA TTATAGCTGG AGCCATACAA GACAACGATA GAGGATTAAT TATAGGACGC
AGGTCTTATG GTAAAGGATT AGTGCAACGC GAAATGGCAC TAGGAGATGG TAGCGCTGTA
CGCCTTACCA TTGCACGTTA TTACACGCCT ACTGGAAGAA GCATCCAAAA ACCTTATGAA
AATGGGAACG ACGCTTACTT TAGTGATTAC TTAAACAGAT ATAAAAATGG CGAGCTTATT
AGCCAAGACA GTATAACAGT TGCAGATTCT TTAAAGTTTA GAACACCAAA AGGTAAAATT
GTTTATGGTG GCGGCGGTAT TATACCAGAT ATATTTATAC CTAAAGACAC CTCATATGAA
AGTGAAAGTA TTCGTTTTTT ATTACGAAGC GGCTTTATGA ACAGGTTTGT GTTTAATCTT
TTAGAAAAAA ACAGAGACTT TTATACGAGT TTAAGTTTTA ATGAATTTGT AGAGAAAGAA
CTTATTACAA ATGCTACAGC TCAAGAATTT ATAAACTATA CGGCAAACCA AGGATATAGG
CTACAAGTTA AAAATCATTT ACCTGATCTA AAACGTTACC TAAAAGCAAC AATGGCACAA
CAATTATATG GCAGTAATGC TTTTGAGAAA TTAGTAAATG AAGATGATGC GTTTATAAAG
AAGGTAATTA AGATATCTAC CGAAGATGCT AGTCGATTTT TAGACTAG
 
Protein sequence
MKFYKKSYLP LFLGIACAFG IFLGAKLNFS NGSESLFASN PKKEKLNRLI DYIDYEYVDQ 
VNTDSIVDVT VDGILENLDP HSTYIPQSEY KAVAENMKGD FVGIGVSFYT IRDTISVINT
IEGGPSEKSG IVGGDRILYA DNIKLFGNNV TNDSLSNFLK GKRNTNVTLT VYRPSEDKTF
KTTIKRGDVP LKSVDAFYML TDELGYIKMN RFAESTYTEF KQALTTLQQQ GATELAIDLR
GNPGGYIGQT TKIIDEFLED DKLILFTKNK KGKVENTYAT RSGNFEDGGI YVLIDETSAS
ASEIIAGAIQ DNDRGLIIGR RSYGKGLVQR EMALGDGSAV RLTIARYYTP TGRSIQKPYE
NGNDAYFSDY LNRYKNGELI SQDSITVADS LKFRTPKGKI VYGGGGIIPD IFIPKDTSYE
SESIRFLLRS GFMNRFVFNL LEKNRDFYTS LSFNEFVEKE LITNATAQEF INYTANQGYR
LQVKNHLPDL KRYLKATMAQ QLYGSNAFEK LVNEDDAFIK KVIKISTEDA SRFLD