Gene CA2559_02385 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCA2559_02385 
Symbol 
ID9295962 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCroceibacter atlanticus HTCC2559 
KingdomBacteria 
Replicon accessionNC_014230 
Strand
Start bp564692 
End bp566851 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content36% 
IMG OID 
ProductProlyl endopeptidase 
Protein accessionYP_003715243 
Protein GI298207064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAAC TCATCCTTGT AACAATAACT GCAGCAACAA TATTTAGCTG TAAAACAGAG 
ACTAAAACCG ATAGAACCAT AGCAGTGACA TACCCCGAAA CAAAGAAAGT AGATACCGTA
GATGTATATT TTGGTAATGA GGTGAAAGAC CCATATCGCT GGTTAGAAGA TGATCGCAGT
AAAGAAACCG AAGATTGGGT GAAAGCTCAA AACCAAGCTA CATTTGGATA TTTAGACAAA
ATTCCTTTTA GAGAAGATCT TAAAAACAGA TTAACCGAAC TGTGGAATTA TGAAAAGTTG
GGATCACCCT TTAAAGAAGG TGAGTATACC TACTATTTTA AAAACAATGG GTTGCAAAAC
CAAAGTGTGA TTTATAGGTA TAAATCTACC GAAAGCCCTG AAAATGCTAA AGTATTCCTG
GATCCAAATA AGTTTAGTGA AGACGGTACA ACATCATTAG GAGGATTAAA CTTTTCTAAA
GATGGAAGTA AAGCAGCTTA TTCAATTTCT GAAGGCGGTA GTGATTGGAG AAAAGTAATT
GTTGTAGATG CAGAAACCTT GGAACGTGTT GAAGATACTT TACAGGATAT TAAATTTAGT
GGTGTGTCTT GGAACGTGAA TGAAGGATTT TATTATTCAA GTTATGACAA ACCTAAAGGC
AGTGAGTTGT CTGCAAAAAC AGACCAGCAT AAACTATATT ATCACAAGCT AGGAACCTCT
CAAAAAGAGG ATAAACTTAT TTTTGGAGGA ACACAAGAAG AAAAAAGAAG ATATGTTGGT
GGCAGTGTAA CAGAAGATGG TAAGTATTTA ATTGTTTCAG GAAGTGTATC AACCTCAGGA
AACGATTTAA GAATAAAAGA CCTTACCAAG CCAAATTCAG ATTTTAAAAC TATAATTTCT
GGCTACGAAA CAGATTCATA CGTTATAGAA AATGAAGGCA GTAAACTATA TATTGTAACA
AACCTAAATG CACCTAATAA AAAAATTGTG ACGGTTGATG CTGAAAACCC ATCACCAGAA
AATTGGGTAG ATTTCATTCC AGAAACAGAA CATGTGCTTA GTCCTAGTAA AGCTGGTGGC
TACTTCTTTG CAGAATATAT GGTAGATGCT GTAAGCGAAG TAAAACAATA CGATTATGCT
GGTAAATTAA TACGTGAAGT TAAACTTCCA GGAGTTGGAA CAGTTGGTGG CTTTGGTGCT
AAAAAAGAAG ATAAAGAACT GTATTTCTCT TTTACAAATT ATGTAACACC AGGCAGCATA
TATAAGTATG ATATTGAAGA TGGTAATTCA GAGCTATATG TAAAACCAGA AATAGATTTT
AATCCAGACC ATTATAAGAG TGAACAGGTG TTCTTTAACT CTAAAGATGG TACAAAAATA
CCCATGATTA TAACCTATAA AAAAGGGACA GAGCTTAATG GTAAGAACCC TACGATACTA
TATGGTTATG GAGGTTTCAA TATAAGTTTA ACACCAAGTT TTAGTATAGC AAACGCTGTG
TGGATGGAGC AAGGTGGAAT TTATGCAGTT CCTAATTTGC GCGGTGGTGG AGAATACGGT
AAAGCTTGGC ATGATGCTGG TACTAAACTA CAAAAGCAAA ATGTATTTAA TGACTTTATA
GCTGCGGCAG AATATTTAAT TGAGAAGAAC TACACATCAA AAGAATATTT GGCAATTAGA
GGCGGTTCAA ATGGTGGATT ATTAGTCGGA GCCACGATGA CACAACGACC AGATTTAATG
CAAGTAGCAT TGCCTGCAGT AGGCGTGATG GATATGTTAC GCTATCATAC CTTTACAGCA
GGTGCAGGTT GGGCATATGA TTATGGAACG GCAGAAGATT CCGATGAAAT GTTTCAATAC
CTAAAAGGAT ACTCGCCAGT ACACAATGTA AAAGAAGGTG TTTCTTATCC TGCTACAATG
GTAACTACTG GAGATCATGA TGATCGCGTA GTACCAGCGC ATAGTTTTAA GTATGCTGCA
GAGTTGCAAG ATAAACAAGC TGGAAATGCT CCTACATTAA TTAGAATTGA AACTAATGCT
GGCCATGGTG CAGGAACACC AGTAAGTAAA ACTATAGAGC AGTACGCAGA TATTTTTGGT
TTTACGCTTT ACAATATGGG TTATGATGAG TTGCCGGTAA AGAAACAATT TAAAGACTAA
 
Protein sequence
MKQLILVTIT AATIFSCKTE TKTDRTIAVT YPETKKVDTV DVYFGNEVKD PYRWLEDDRS 
KETEDWVKAQ NQATFGYLDK IPFREDLKNR LTELWNYEKL GSPFKEGEYT YYFKNNGLQN
QSVIYRYKST ESPENAKVFL DPNKFSEDGT TSLGGLNFSK DGSKAAYSIS EGGSDWRKVI
VVDAETLERV EDTLQDIKFS GVSWNVNEGF YYSSYDKPKG SELSAKTDQH KLYYHKLGTS
QKEDKLIFGG TQEEKRRYVG GSVTEDGKYL IVSGSVSTSG NDLRIKDLTK PNSDFKTIIS
GYETDSYVIE NEGSKLYIVT NLNAPNKKIV TVDAENPSPE NWVDFIPETE HVLSPSKAGG
YFFAEYMVDA VSEVKQYDYA GKLIREVKLP GVGTVGGFGA KKEDKELYFS FTNYVTPGSI
YKYDIEDGNS ELYVKPEIDF NPDHYKSEQV FFNSKDGTKI PMIITYKKGT ELNGKNPTIL
YGYGGFNISL TPSFSIANAV WMEQGGIYAV PNLRGGGEYG KAWHDAGTKL QKQNVFNDFI
AAAEYLIEKN YTSKEYLAIR GGSNGGLLVG ATMTQRPDLM QVALPAVGVM DMLRYHTFTA
GAGWAYDYGT AEDSDEMFQY LKGYSPVHNV KEGVSYPATM VTTGDHDDRV VPAHSFKYAA
ELQDKQAGNA PTLIRIETNA GHGAGTPVSK TIEQYADIFG FTLYNMGYDE LPVKKQFKD